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Replication-Competent Anti-Cancer Vectors 
Reference to Government Grant 

This invention was made with government support under a grant from the National 
Institutes of Health, Grant Number ROl CA71704 and CA81829. The United States 
Government has certain rights in this invention. 
Background of the Invention 

(1) Field of the Invention 

This invention relates generally to the treatment of cancer and more particularly to 
vectors which replicate in neoplastic cells and which overexpress an adenovirus death protein 
(ADP) and to the use of these vectors in treating human cancer. 

(2) Description of the Related Art 

Cancer is a leading cause of death in the United States and elsewhere. Depending on 
the type of cancer, it is typically treated with surgery, chemotherapy, and/or radiation. These 
treatments often fail: surgery may not remove all the cancer; some cancers are resistant to 
chemotherapy and radiation therapy; and chemotherapy-resistant tumors frequently develop. 
New therapies are necessary, to be used alone or in combination with classical techniques. 

One potential therapy under active investigation is treating tumors with recombinant 
viral vectors expressing anti-cancer therapeutic proteins. Adenovinis-based vectors contain 
several characteristics that make them conceptually appealing for use in treating cancer, as 
well as for therapy of genetic disorders. Adenovinises (hereinafter used interchangeably with 
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"Ads") can easily be grown in culture to high titer stocks that are stable. They have a broad 
host range, replicating in most human cancer cell types. Their genome can be manipulated by 
site-directed mutation and insertion of foreign genes expressed from foreign promoters. 

The adenovirion consists of a DNA-protein core within a protein capsid (reviewed by 
5 Stewart et al., "Adenovirus structure by x-ray crystallography and electron microscopy." in: 
The Molecular Repertoire of Adenoviruses^ Doerfler, W. et al., (ed).. Springer- Verlag, 
Heidelberg, Germany, p. 25-38). Virions bind to a specific cellular receptor, are endocytosed, 
and the genome is extruded from endosomes and transported to the nucleus. The genome is a 
linear duplex DNA of about 36 kbp, encoding about 36 genes (Fig. lA). In the nucleus, the 

1 0 "inmiediate early*' E 1 A proteins are expressed initially, and these proteins induce expression 
of the "delayed early" proteins encoded by the ElB, E2, E3, and E4 transcription units 
(reviewed by Shenk, T. "Adenoviridae: the viruses and their replication" in: Fields Virology^ 
Field, B.N. et al., Lippencott-Raven, Philadelphia, p. 21 1 1-2148). El A proteins also induce 
or repress cellular genes, resulting in stimulation of the cell cycle. About 23 early proteins 

1 5 function to usurp the cell and initiate viral DNA replication. Viral DNA replicates at about 7 
h post-infection (p.i.), then late genes are expressed from the "major late" transcription unit. 
Major late mRNAs arc synthesized from the common "major late promoter" by alternative 
pre-mRNA processing. Each late mRNA contains a conmion "tripartite leader" at its 5'- 
terminus (exons 1, 2, and 3 in Fig. 1), which allows for efficient translation of Ad late 

20 mRNAs. Cellular protein synthesis is shut off, and the cell becomes a factory for making 
viral proteins. Virions assemble in the nucleus at about 1 day p.i., and after 2-3 days the cell 
lyses and releases progeny virus. Cell lysis is mediated by the E3 1 1,6K protein, which has 
been renamed "adenovirus death protein" (ADP) (Tollefson et al., J. Virol. 70:2296-2306, 
1996; Tollefson et al., Virol. 220:152-162, 1996). The term ADP as used herein in a generic 

25 sense refers collectively to ADFs from adenoviruses such as, e.g. Ad type 1 (Adl), Ad type 2 
(Ad2), Ad type 5 (Ad5) or Ad type 6 (Ad6) all of which express homologous ADP*s with a 
high degree of sequence similarity. 

Human adenovirus type 5 (Ad5) is particularly usefiil for cancer gene therapy. It 
primarily causes asymptomatic or mild respiratory infections in yoxmg children, followed by 

30 long term effective immunity. Fatalities are extremely rare except when the patient is 

immunocompromised (Horwitz, M. S., Adenoviruses, p. 2149-2171 In B. N. Fields, D, M. 
Kntpe, and P. M. Howley (eds.). Fields Virology, Lippincott-Raven Publishers, Philadelphia, 
PA, 1996). AdS is very well understood, can be grown in culture to high titer stocks that are 
stable, and can replicate in most human cancer cell types (Shenk, T., Adenoviridae: the 

35 viruses and their replication, p. 21 1 1-2148. In B. N. Fields, D. M. Knipe, and P. M. Howley 
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(eds.)) Fields Virology, Lippincott-Raven, Philadelphia, 1996). Its genome can be 
manipulated by site-directed mutagenesis and insertion of foreign sequences. 

The Ad vectors being investigated for use in anti-cancer and gene therapy are based 
on recombinant Ad's that are either replication-defective or replication-competent. Typical 
5 replication-defective Ad vectors lack the El A and ElB genes (collectively known as El) and 
contain in their place an expression cassette consisting of a promoter and pre-mRNA 
processing signals which drive expression of a foreign gene. The El A proteins induce 
transcription of other Ad genes, and in nontransformed cells they deregulate the cell cycle, 
induce or repress a variety of cellular genes, and force cells from Go into S-phase 48 (White, 

10 E., Semin. ViroL 5:505-513. 1998; Wold et aL, pp. 200-232 In AJ. Cann (ed), DNA Virus 
Replication: Frontiers in Molecular Biology, Oxford University Press, Oxford). The ElB 
proteins inhibit cellular apoptosis. Id. These vectors are unable to replicate because they lack 
the El A genes required to induce Ad gene expression and DNA replication. In addition, the 
E3 genes are usually deleted because they are not essential for virus replication in cultured 

15 cells. 

A number of investigators have constructed replication-defective Ad vectors 
expressing anti-cancer therapeutic proteins. Usually, these vectors have been tested by direct 
injection of human tumors growing in mouse models. Most commonly, these vectors express 
the thymidine kinase gene from herpes simplex virus, and the mice are treated with 

20 gancyclovir to kill cells transduced by the vector (see e.g., Felzmann et ah. Gene Then 

^: 1322-1329, 1997). Another suicide gene therapy approach involves injecting tumors with a 
replication defective Ad vector expressing cytosine deaminase, followed by administration of 
5-fluorocytosine (Topf et al. Gene Then 5:507-513, 1998). Investigators have also prepared 
and tested replication-defective Ad vectors expressing a cytokine-such as IL-2, IL-12, IL-6, 

25 tumor necrosis fector (TNF)» type I interferons, or the co-stimulatory molecule B7-1 in the 
anticipation that the Ad-expressed cytokine will stimulate an inunune response, including 
cytotoxic T-lymphocytes (CTL), against the tumor (Felzmann et al., supra; Putzer et al.. Proa 
Natl. Acad. Sci. USA P¥:10889-10894, 1997). Other vectors express tumor antigens (e.g. 
melanoma MARTI), proteins that de-regulate the cell cycle and induce apoptosis (p53, pRB, 

30 p2l*^*''*^''", pl6^''^, and even Ad El A), and ribozymes. An Ad vector expressing FasL 
induces apoptosis and tumor regression of a mouse tumor (Arai et al., Proc. Natl Acad. Sci. 
USA P-/;13862-13867, 1997). 

Despite these generally positive reports, it is recognized in the art that 
replication-defective Ad vectors have several characteristics that make them suboptunal for 

35 use in therapy. For example, production of replication-defective vectors requires that they be 
grown n a complementing cell line that provides the El A proteins in trans. Such cell lines 
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are fastidious, and generation of virus stocks is time-consuming and expensive. In addition, 
although many foreign proteins have been expressed from such vectors, the level of 
expression is low c mpared to Ad late proteins. 

To address these problems, several groups have proposed using Feplication- 
S competent Ad vectors for therapeutic use. Replication-competent vectors retain Ad genes 
essential for replication and thus do not require complementing cell lines to replicate. 
Replication-competent Ad vectors lyse cells as a natural part of the life cycle of the vector. 
Another advantage of replication-competent Ad vectors occurs when the vector is engineered 
to encode and express a foreign protein. Such vectors would be expected to greatly amplify 

1 0 synthesis of the encoded protein in vivo as the vector replicates. However, in order to prevent 
RC vectors from damaging normal tissues and causing disseminated viremia, it is important 
that they have some feature that limits their replication to cancer cells. 

Wyeth Laboratories developed replication-competent Ad vectors for vaccination 
purposes, using vaccine strains of Ad serotypes 4, 7, and 5 (Lubeck et al., AIDS Res, Htm, 

1 5 Retroviruses 10: 1443-1449, 1994). Foreign genes were inserted into the E3 region (with the 
E3 genes deleted) or into a site at the right end of the genome. Two foreign genes used were 
hepatitis B surface antigen and the HIV envelope protein. They obtained good expression in 
culture, and were able to raise antisera in animal models. Phase I human trials were 
ambiguous, and the project was mostly abandoned. 

20 Onyx Pharmaceuticals recently reported on adenovinis-based anti-cancer vectors 

which are replication deficient in non-neoplastic cells but which exhibit a replication 
phenotype in neoplastic cells lacking functional p53 and/or retinoblastoma (pRB) tumor 
suppressor proteins (U.S. Patent No. 5,677,178; Heise et al.. Nature Med. (5:639-645, 1997; 
Bischoff et al.. Science 274:373-316^ 1996). This phenotype is reportedly accomplished by 

25 using recombinant adenoviruses containing a mutation in the ElB region that make the 

encoded E1B-55K protein incapable of binding to p53 and/or a mutation(s) in the El A region 
which make the encoded El A protein (p289R or p243R) incapable of binding to pRB and/or 
the cellular 300 kD polypeptide and/or the 1 07 kD polypeptide. E1B-55K has at least two 
independent functions: it binds and inactivates the tumor suppressor protein p53, and it is 

30 required for efficient transport of Ad mRNA from the nucleus. Because these ElB and ElA 
viral proteins are involved in forcing cells into S-phase, which is required for replication of 
adenovirus DNA, and because the p53 and pRB proteins block cell cycle progression, the 
recombinant adenovirus vectors described by Onyx should replicate in celk defective in p53 
and/or pRB, which is the case for many cancer cells, but not in cells with wild-type pS3 

35 and/or pRB. Onyx has reported that replication of an adenovirus lacking E1B-55K, which is 
named ONYX-015, was restricted to p53-minus cancer cell lines (Bischoff et al., supra), and 
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that ONYX-015 slowed the growth or caused regression of a pSS-minus human tumor 
growing in nude mice (Heise et al., supra). Others have challenged the Onyx report claiming 
that replication of ONYX-015 is independent of p53 genotype and occurs efficiently in some 
primary cultured human cells (Harada and Beric, / Virol 7J:5333-5344, 1999). It is now 
5 known that ONYX-015 can replicate in cells with wild-type p53 (Goodrum et al., J. Virol 
72:9479-9490, 1998; Harada et al., J, Virol 75:5333-5344, 1999; Hay et al.. Hum. Gene Ther. 
70:579-590, 1999; Rothmann et al., J. Virol 72:9470-9478, 1998; Tumell et al., /. Virol 
75:2074-2083, 1999). ONYX-015 does not replicate as well as wild-type adenovirus because 
E1B-S5K is not available to facilitate viral mRNA transport from the nucleus. Also, ONYX- 

10 015 expresses less ADP than wild-type virus (see Example 1 below). 

As an extension of the ONYX-015 concept, a replication-competent adenovirus 
vector was designed that has the gene for E1B-55K replaced with the herpes simplex virus 
thymidine kinase gene (Wilder et al., Gene Therapy tf:57-62, 1999). The group that 
constructed this vector reported that the combination of the vector plus gancyclovir showed a 

1 5 therapeutic effect on a human colon cancer in a nude mouse model (Wilder et al.. Cancer Res. 
59:410-413, 1999). However, this vector lacks the gene for ADP, and accordingly, the vector 
will lyse cells and spread from cell-to-cell less efficiently than an equivalent vector that 
expresses ADP. The gene for ADP is also lacking in another replication-competent 
adenovirus vector that has been described, in which a minimal enhancer/promoter of the 

20 human prostate specific antigen was inserted into the adenovirus El A enhancer/promoter 
(Rodriguez etal., Cancer Res. 57:2559-2563, 1997). 

Another strategy for replication-competent vector improvement is to place replication 
under the control of tissue-specific promoters. One group replaced the basal El A promoter 
with a modified promoter for a-fetoprotein (AFP) (Hallenbeck et al.. Hum. Gene Ther. 

25 70:1721-1733, 1999). AFP is expressed in the liver during development, but it is not 
expressed in adults. However, it is expressed in 70-80% of patients with hepatocellular 
carcinoma. Growth of this vector was limited to AFP-expressing cells and the vector showed 
some suppression of xenotransplants. Id. AseriesofRC vectors has also been developed 
that have expression of the El A and ElB genes dependent on the prostate tumor-specific 

30 prostate specific antigen (PSA) and kallikrein promoters/enhancers (Rodriguez et al., Cancer 
Res. 60: 1 196, 1997; Yu et al., Cancer Res.59:4200A203, 2000; Yu et al.. Cancer Res 
59:1498-1504, 1999). 

Thus, there is a continuing need for vectors that replicate and spread efficiently in 
tumors but that can be modified such that they replicate poorly or not at all in normal tissue. 

35 Summary of the Invention 
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Briefly, theref re, the present invention is directed to novel vectors which are 
replication competent in neoplastic cells and which overexpress an adenovirus death protein 
(ADP). The work reported herein demonstrates the discovery that overexpression of ADP by 
a recombinant adenovirus allows the construction of a replication-competent adenovirus that 
5 kills neoplastic cells and spreads from cell-to-cell at a rate similar to or faster than that 
exhibited by adenoviruses expressing wild-type levels of ADP, even when the recombinant 
adenovirus contains a mutation that would otherwise reduce its replication rate in non- 
neoplastic cells. This discovery was unexpected because it could not have been predicted 
from what was known about adenovirus biology that Ad vectors overexpressing ADP remain 

1 0 viable and that the infected cells are not killed by the higher amounts of ADP before the Ad 
vector produces new virus particles that can spread to other tumor cells. Indeed, naturally- 
occurring adenoviruses express ADP in low amounts from the E3 promoter at early stages of 
infection, and begin to make ADP in large amounts only at 24-30 h p.i., once virions have 
been assembled in the cell nucleus. It is believed that other non-adenoviral vectors can be 

1 5 used to deliver ADP's cell-killing activity to neoplastic cells, including other viral vectors and 
plasmid expression vectors. 

Thus, in one preferred embodiment, the ADP-expressing vector comprises a 
recombinant adenovirus lacking expression of at least one E3 protein selected from the group 
consisting of: gpl9K; RIDa (also known as 10.4K); RIDP (also known as 14.SK) and 14.7K. 

20 Becatise these E3 proteins inhibit immune-mediated inflammation and/or apoptosis of Ad- 
infected cells, it is believed that a recombinant adenovirus lacking one or more of these £3 
proteins will stimulate infiltration of inflammatory and inunune cells into a tumor treated with 
the adenovirus and that this host immune response will aid in destruction of the tumor as well 
as tumors that have metastasized. The ADP expressed by preferred embodiments comprises a 

25 naturally-occurring amino acid sequence from a human adenovirus of subgroup C, namely 
Adl, Ad2, Ad5 and Ad6. 

In another embodiment, replication of the vector is restricted to neoplastic cells. Such 
replication-restricted vectors are useful in treating cancer patients in which it is desirable to 
eliminate or reduce damage to normal cells and tissues that might be caused by the vector, 

30 particularly viral vectors that kill the host cell as part of their life cycle. In preferred 

embodiments, a recombinant adenovirus has a replication-restricted phenotype because the 
recombinant adenovirus is incapable of expressing an El A viral protein which binds the pRB 
and the p300/CBP proteins or because the E4 promoter has been substituted with a promoter 
that is activated only in neoplastic cells and/or cells of a specific tissue. 

35 In yet another embodiment, the invention provides a vector which overexpresses ADP 

and whose replication is under the control of a tissue specific promoter, tumor specific 
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promoter or an inducible promoter. In preferred embodiments, the vector comprises a 
recombinant adenovirus in which the tissue specific promoter or inducible promoter is 
substituted for the £4 promoter. Such vectors are useful for restricting replication of the 
vector and its ADP-mediated cell killing to cells of a particular type or to cells exposed to an 
5 exogenous agent that activates the promoter. A preferred tissue-specific or inducible vector 
also expresses a phenotype that restricts its replication to neoplastic cells. 

In yet another embodiment, the invention provides a vector which overexpresses ADP 
but which is not restricted to tumors by a specific genetic modification. Such a vector is more 
destructive to neoplastic cells than even the naturally occurring Ad*s of subgroup C. In 

1 0 preferred embodiments, this vector could be used for patients with terminal cancer not 

treatable by another method, and who have pre-existing neutralizmg antibodies to Ad or to 
which neutralizing antibodies can be administered 

In still another embodiment, the invention provides a composition comprising a first 
recombinant virus which is replication competent in a neoplastic cell and overexpresses the 

15 adenovirus death protein. In one embodiment, the recombinant virus is contained within a 
delivery vehicle comprising a targeting moiety that limits delivery of the virus to cells of a 
certain type. With this embodiment, the replication-competent vector can be of any ADP- 
overexpressing configuration described herein. In some embodiments, the composition also 
comprises a second recombinant virus which is replication-defective and which expresses an 

20 anti-cancer gene product. In some embodiments, the replication-defective vector may be 
engineered to overexpress ADP when replication of this vector is complemented by a 
replication-competent vector. The recombinant virus complements spread of the replication- 
defective virus, as well as its encoded anti-cancer product, throughout a tumor. In prefenred 
embodiments, the first recombinant virus is a recombinant adenovirus whose replication is 

25 restricted to neoplastic cells and/or which lacks expression of one or more of the £3 gpl9K; 
RIDa; RID(3; and 14.7K proteins. 

In additional embodiments, the invention provides replication-competent vectors that 
over&cpresses an ADP and also expresses an anti-cancer product. As with previous 
embodiments, the vector can be of any ADP-overexpressing configuration provided herein. 

30 Preferably, replication of the virus is engineered to (a) be restricted to neoplastic cells, e.g., by 
replacing the E4 promoter with a tissue specific or tumor specific promoter and/or (b) lack 
expression of one or more of the E3 gpl9K; RIDa; RIDP; and 14.7K proteins. In some 
embodiments, the anti-cancer product is inserted into the E3 region. 

The ADP-expressing vectors and compositions of the invention are useful in a 

35 method for promoting death of a neoplastic cell. The method comprises contacting the 

neoplastic cell with a vector which is replication-competent in the neoplastic cell and which 



wo 01/04282 



PCT/USOO/18971 



8 

overexpresses ADP. Where the neoplastic cell comprises a tumor in a patient, the vector is 
administered directly to the tumor or, in other embodiments, the vector is administered to the 
patient systemically or in a delivery vehicle containing a targeting moiety that directs delivery 
of the vector to the tumor. In embodiments where the vector is a recombinant virus, the 
S method can also comprise passively inmiunizing the patient against the virus. 

In yet another embodiment of fhe invention, the vector may be used in combination 
with radiation therapy. The radiation therapy can be any form of radiation dierapy used in the 
art such as for example, external beam radiation such as x-ray treatment, radiation delivered 
by insertion of radioactive materials within the body near or at the tumor site such as 
1 0 treatment with ganrnia ray emitting radionuclides, particle beam therapy which utilizes 

neutrons or charged particles and the like. In addition, this embodiment encompasses the use 
of more than one of the vectors of the present invention in a cocktail in combination with 
radiation therapy. 

Another embodiment of the invention involves the use of the recombinant vector in 

1 5 combination with chemotherapy as has been disclosed for other adenovirus vectors (U.S. 
Patent No. 5,846,945). Chemotheraputic agents are known in the art and include 
antimetabolites including pyrimidine-analogue and purinc-analogue antimetabolites, plant 
alkaloids, antitumor antibiotics, alkylating agents and the like. The use of more than one of 
the vectors of the present invention with a chemotheraputic agent or agents is also 

20 contemplated within this embodiment. 

Among the several advantages found to be achieved by the present invention, 
therefore, may be noted the provision of replication-competent vectors, particularly viruses, 
which rapidly kill cancer cells and spread from cell-to-cell in a tumor, the provision of such 
vectors whose replication can be induced or which is restricted to tumors and/or to cells of a 

25 certain tissue type; and the provision of compositions and methods for anti-cancer therapy 
which cause little to no side effects in normal tissues. 
Brief Description of the Drawings 

Figure 1 is a schematic of gene expression in AdS (Fig. 1 A) and KD3, a preferred 
embodiment of the invention (Fig. IB), in which the respective genomes are represented by 

30 the stippled bars and transcription units represented by arrows above and below the bars» with 
the E3 proteins listed above the arrows for the E3 transcription unit, and the LI to L5 families 
of late mRNA's indicated. 

Figure 2 illustrates the overexpression of ADP by KDl, KD3 , GZi, and GZ3 
showing an immunbblot of proteins isolated from human A549 cells infected with the 

35 indicated viruses and probed with an anti-ADP antibody, with ADP indicating differently 
glycosylated and proteolytically processed forms of ADP. 



wo 01/04282 



PCT/USOO/18971 



Figure 3 illustrates that the El A dl\ 101/1 107 mutation referred to in the figure and 
hereinafter as J/01/07, retards expression of late proteins, showing an immunoblot of El A 
proteins and late proteins in A549 cells infected with the indicated viruses in the absence 
(Figs. 3 A and 3B) or presence (Figs. 3C and 3D) of J/327, which has a wild-type El A region 
5 and has a deletion of all E3 genes but the gene encoding the 12.5K protein (Figs. 3C and 3D). 
An antiserum specific to the El A proteins was used for Fig. 3 A and 3C. An antiserum raised 
against AdS virions was used for Figs. 3B and 3D. 

Figure 4 illustrates that KDl and KD3 kill cells more efficiently than control viruses 
that express less or no ADP, showing a graph of the percent of A549 cells infected with the 
1 0 indicated viruses that were viable at the indicated days p.i. as determined by trypan blue 
exclusion. 

Figure 5 is a cell spread assay illustrating that overexpression of ADP enhances 
spread of virus from cell to cell, showing monolayers infected with the indicated viruses at the 
indicated PFU/cell which were treated at 7 days p.i. with crystal violet, which stains live cells 

IS but not dead cells. 

Figure 6 illustrates that KDl and KD3 replicate well in growing cells but not in 
growth-arrested cells showing the virus titer extracted from growing or growth arrested HEL- 
229 cells at various times following infection with 100 PFU/ml of the following viruses: 
rf/309 (Fig. 6A), J/01/07 (fig. 6B), KDl (Fig. 6C) and KD3 (Fig 6D). 

20 Figure 7 illustrates that KDl and KD3 are defective in killing primary human 

bronchial epithelial cells showing these cell monolayers infected at 30% confluency with 10 
PFU/ml of the indicated viruses and stained at 5 days p.i, with neutral red. 

Figure 8 illustrates that KDl and KD3 reduce the growth rate of hiunan A549 cell 
tumors growing in nude mice, showing in Fig. 8 A a graph of average-fold increase in tumor 

25 size plotted against the number of weeks following infection of the tumor with buffer or with 
5x10' PFU at weekly intervals of or the indicated viruses, and showing in Fig. 8B a similar 
graph of tumors injected once with 5 x 10^ PFU of KD3 or GZ3. 

Figure 9 illustrates that KDl and KD3 reduce the growth rate of human Hep3B cell 
tumors growing in nude mice, showing a gr^h of average-fold increase in tumor size plotted 

30 against the number of weeks following injection of the tumor with buffer or with 5 x lO' PFU 
of rfO09, KDl or KD3 at twice weekly intervals of the indicated viruses. 

Figure 10 illustrates that KDl and KD3 complement the replication and spread of Ad- 
^gal, a replication-defective vector that expresses P-galactosidase, using an infectious center 
assay showing in Fig. lOA a picture of A549 cell monolayers seeded with A549 cells infected 

35 with Ad-|3--gal alone or with the indicated viruses, with Figs lOB and IOC showing close-up 
views of two of the monolayers of Fig. lOA. 
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Figure 1 1 is a bar graph illustrating that KDl and KD3 increase the expression of 
luciferase in human Hep3B cell tumors growing in nude mice, using an assay in which tumors 
were injected with the indicated combinations of viruses, then were extracted 2 weeks p.i. and 
assayed for luciferase activity. The numbers in parentheses indicated the fold increase in 
5 luciferase activity compared to that of the Adluc vector plus buffer. 

Figure 12 is a graph showing the results of a standard plaque development assay for 
KDl and KDl-SPB on A549 cells engineered to express the TTFl transcription factor 
(A549/TTP1) and the parental 549 cells, in which data are plotted as the number of plaques 
observed on a particular day in the assay divided by the final number of plaques observed for 
1 0 that virus multiplied by 100. 

Figure 13 is a cell spread assay for KDl and KDl-SPB on H441 cells and HepBB 
cells, where cells were infected with the indicated amounts of KDl or KDl-SPB and H441 
cells and HepSB cells were strained with crystal violet at 5 days p.i. and 8 days p.i., 
respectively. 

1 5 Figure 1 4 is a graph showing the results of a standard plaque development assay for 

rf/309 and two preferred embodiments of the invention, GZl and GZ3, in which data are 
plotted as the number of plaques observed on a particular day in the assay divided by the final 
number of plaques observed for that virus multiplied by 100. 

Figure 15 is a cell spread assay illustrating that the combination of KDl, KD3, GZl, 

20 or GZ3 with x-ray radiation is more effective in destroying A549 cell monolayers than is 

virus vector alone or radiation alone, wherein cells were infected with the indicated amounts 
of the indicated viruses, radiated with 600 centigreys (cGy) of x-radiation (bottom panel), or 
mock radiated (top panel), then stained with crystal violet at 6 days p.i. 

Figure 16 is a graph of a cell spread assay illustrating that 10'^ PFU of KDl, KD3, 

25 GZl, or GZ3 used in combination with 150, 300, or 600 centigreys of radiation is more 

effective in destroying A549 cell monolayers than virus vector alone or radiation alone. Cell 
viability is based on the amount of crystal violet extracted from the culture wells, using the 
mock-infected non-radiated well as 100% viability. 

Figure 17 illustrates that the combination of KD3 or GZ3 plus x-ray radiation is more 

30 effective in reducing the growth of A549 cell tumors growing in nude mice than KD3 alone or 
GZ3 alone. 

Figtire 18 illustrates a structure-function analysis of ADP, showing in Fig. 18A the 
amino acid sequence of the adenovirus death protein encoded by Ad2, with the various 
putative domains and glycosylation sites labeled and showing in Fig. 18B a schematic of the 
35 ADP gene in reclOO and in the indicated deletion mutants, with the right column 
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summarizing the death promoting phenotype of the various mutants as a percentage of the 
wild-type phenotype. 

Figxires 19A and 19B illustrate a cell viability assay of the indicated ADP mutants 
showing a graph of viability as determined by trypan blue exclusion plotted against hours 
5 (Fig. 19 A) or days (Fig. 19B) postinfection. 

Figure 20 depicts the amino acid sequence, shown in single letter code, for the ADP 
proteins of Adl, Ad2, Ad5, and Ad6 (SEQ ID NOS:5-8), for the Ad2 ADP mutants d/716, 
dniSy rf/714, and dn37 (SEQ ID N0S:9-12), and for the putative lumenal domain (SEQ ID 
NO: 17), the transmembrane domain (SEQ ID NO: 18), the cytosolic basic-proline domain 
1 0 (SEQ ID NO: 1 9), and the remainder of the cystosolic domain (SEQ ID NO:20) of the ADP 
protein of Ad2. 

Figure 21 presents the complete nucleotide sequence of the genome of Ad5. 

Figure 22 presents the complete nucleotide sequence of the genome of KDl (SEQ ED 

NO:l). 

15 Figure 23 presents the complete nucleotide sequence of the genome of KD3 (SEQ ID 

NO:2). 

Figure 24 is a schematic ofthe following vectors: A. Ad5, The stippled bar 
indicates the DNA genome of 36 kbp. The open arrow indicates the immediate early El A 
transcription unit, and the black arrows are the delayed early ElB, E2, E3, and E4 

20 transcription units. The hatched arrows indicate the five families of major late mRNAs, and 
also the ADP mRNA, which is synthesized as part of the major late transcription unit. Each 
major late mRNA has a tripartite leader (leaders 1, 2, and 3) spliced to its 5' terminus. B. 
dl309. dl309 is identical to Ad5 except it has the E3-RID and E3-14.7K genes deleted. dl309 
expresses ADP at levels similar to Ad5. C. KDl. KDl has two small deletions (indicated by 

25 "X" marks) in the El A gene that abolish binding ofthe El A proteins to pRB or p300/CBP. It 
lacks all E3 genes except adp. ADP is expressed earlier in infection and in greater abundance 
than is ADP from Ad5 or dl309 Doronin et al., J. Virol. 74:6141-6155. D. KDl-SPB. KDl- 
SPB is identical to KD 1 , except it has the E4 promoter replaced by the promoter for 
Surfactant Protein B (SPB-P). 

30 Figure 25 presents graphs illustrating that KDl-SPB grows as well as KDl in H441 

Iimg carcinoma cells but much more poorly than KDl in Hep 3B hepatoma cells. CsCl- 
banded stocks of KDl-SPB and KDl were titered using standard methods (ToUe&on et al., p. 
1-9 In W.S.M. Wold (ed.), Adenovirus Methods and Protocols. Humana Press, Inc., Totowa, 
NJ, 1998) on 293-E4 or 293 cells (A), or on A549 cells (B). The data are plotted as the 

3 5 number of plaques seen on any day of the plaque assay as a percentage of the number of 
plaques seen on the final day ofthe assay (ToUefeon et al. Virology 220:152-162, 1996). 
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Figure 26 presents micrographs illustrating that KDl-SPB induces CPE in H441 cells 
but not Hep 3B cells. H441 and Hep 3B mon layers were mock-infected or infected with 10 
PFU/cell of KDl or KDl-SPB, then photographed under phase contrast at 4 or 7 days p.i. 

Figure 27 depicts Southern hybridizations and a graph illustrating that KDl-SPB 
S DNA is synthesized efficiently in H441 but not Hep 3B cells. H441 or Hep 3B cells were 
infected with 10 PFU/cell of KDl or KDl-SPB. Total genomic DNA was isolated at 0, 5, 24, 
48, 72, and 96 h p.i., digested with Hindlll, resolved by agarose gel electrophoresis, blotted, 
and hybridized with "P-labeled Ad DNA. A. Autoradiogram. B. Phosphorlmager 
quantitation of the DNA bands in Panel A. 
1 0 Figure 28 presents graphs depicting single step growth curves showing that KDl-SPB 

grows well in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KDl or 
KDl-SPB. Vectors were extracted at the indicated days p.i. and titers determined by plaque 
assay. 

Figure 29 depicts immunoblots showing that KDl-SPB expresses E40RF3 and ADP 

15 in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KDl or KDl-SPB. At 
24 h p.i., protein extracts were analyzed for El A, E40RF3, and ADP using specific antisera. 
The El A proteins appear as multiple bands. ADP appears as two bands; the upper band is 
glycosylated and the lower band is a proteolytically cleaved species (Scaria et aL, Virology 
797:743-753, 1992; Tollefson et al., J, Virol. 66:2633-3642). 

20 Figure 30 depicts immunofluorescence micrographs showing that KDl-SPB 

expresses E40RF3 in H441 but not Hep 3B cells. Cells growing on coverslips were infected 
with 20 PFU/cell of KDl, KDl-SPB, or dl309 (wild-type). At 48 h (Panel A) or 6 days 
(Panel B), cells were fixed and stained with a rabbit polyclonal antipeptide antiserum against 
E40RF3. Photographs were taken using a lOOX Planapo lens. Each panel shows about 8 

25 nuclei. This figure is part of the same experiment shown in Figure 3 1 . 

Figure 3 1 depicts immunofluorescence micrographs showing that KDl-SPB does not 
express E2-DBP or fiber efficiently in Hep 3B cells. Hep 3B cells were infected with 20 
PFU/cell of KDl-SPB or KDl. At 48 h (A) or 6 days (B) p.i., cells were fixed and double- 
stained using a rabbit polyclonal antiserum against DBP and a mouse monoclonal antibody 

30 against fiber. The same fields are shown for DBP and fiber. This figure is part of the same 
experiment shown in Figure 30. 

Figure 32 presents graphs illustrating that KDl-SPB lyses H441 but not Hep 3B as 
efficiently as KDl. H441 or Hep 3B cells were mock-infected or infected with 20 PFU/cell 
of KDl or KDl-SPB. Cell lysis was determined by release of lactate dehydrogenase from the 

35 cells into the medium. 
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Figure 33 presents graphs illustrating that KDl-SPB suppresses growth of H441 
tumors in nude mice equally as well as KDl . Tumor cells were injected into flanks of nude 
mice and allowed to grow to about 100 fil (H441) or 150 ^! (Hep 3B) volumes. Tumors (n - 
10) were injected with DMEM (mock) or with 5 x lO' PFU of KDl or KDl-SPB. Injections 
5 of the viruses were repeated twice weekly for 3 weeks to a total dose of 3.0 x 10* PFU per 
tumor. Tumors were measured and the mean fold-increase in tumor size was calculated. 
Description of the Preferred Embodiments 

In accordance with the present invention, it has been discovered that overexpression 
of ADP by a recombinant adenovirus results in faster lysis of cells and spread of the virus 
1 0 throughout a cell monolayer than viruses expressing wild-type levels of ADP. It has also 
been discovered that this function for ADP is manifest in an adenovirus that contains El A 
mutations that restrict adenoviral replication to neoplastic cells. Thus, vectors which are both 
replication competent in neoplastic cells and which overexpress ADP should be useful in anti- 
cancer therapy. 

IS In the context of this disclosure, the following terms will be defined as follows unless 

otherwise indicated: 

"Naturally-occurring" as applied to an object such as a polynucleotide, polypeptide, 
or virus means that the object can be isolated from a source in nature and has not been 
intentionally modified by a human. 

20 "Neoplastic cell" means a cell which exhibits an aberrant growth phenotype 

characterized by a significant loss of control of cell proliferation and includes actively 
replicating cells as well as cells in a temporary non-replicative resting state (Gi or G2). A 
neoplastic cell may have a well-differentiated phenotype or a poorly-differentiated phenotype 
and may comprise a benign neoplasm or a malignant neoplasm. 

25 "Recombinant virus" means any viral genome or virion that is different than a wild- 

type virus due to a deletion, insertion, or substitution of one or more nucleotides in the wild- 
type viral genome. The recombinant virus can have changes in the nimiber of amino acid 
sequences encoded and expressed or in the amount or activity of proteins expressed by the 
virus. In particular, the term includes recombinant viruses generated by the intervention of a 

30 human. 

"Replication-competent" as applied to a vector means that the vector is capable of 
replicating in normal and/or neoplastic cells. As applied to a recombinant virus, "replication- 
competent" means that the virus exhibits the following phenotypic characteristics in normal 
and/or neoplastic cells: cell infection; replication of the viral genome; and production and 
35 release of new virus particles; although one or more of these characteristics need not occur at 
the same rate as they occur in the same cell type infected by a wild-type virus, and may occur 
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at a faster or slower rate. Where the recombinant virus is derived from a virus such as 
adenovirus that lyses the cell as part of its life cycle, it is preferred that at least 5 to 25% of the 
cells in a cell culture monolayer are dead 5 days after infection. Preferably, a replication- 
competent virus infects and lyses at least 25 to 50%, more preferably at least 75%, and most 
preferably at least 90% of the cells of the monolayer by 5 days post infection (p.i.). 

"Replication-defective" as applied to a recombinant virus means the virus is incapable 
of, or is greatly compromised in, replicating its genome in any cell type in the absence of a 
complementing replication-competent virus. Exceptions to this are cell lines such as 293 cells 
that have been engineered to express adenovirus El A and ElB proteins. 

"Replication-restricted** as applied to a vector of the invention means the vector 
replicates better in a dividing cell, i.e. either a neoplastic cell or a non-neoplastic, dividing 
cell, than in a cell of the same type that is not neoplastic and/or not dividing, which is also 
referenced herein as a normal, non-dividing cell. Preferably, a replication-restricted virus 
kills at least 10% more neoplastic cells than normal, non-dividing cells in cell culture 
monolayers of the same size, as measured by the number of cells showing cytopathic effects 
(CPE) at 5 days p.i. More preferably, between 25% and 50%, and even more preferably, 
between 50% and 75% more neoplastic than normal cells are killed by a replication-restricted 
virus. Most preferably, a replication-restricted adenovirus kills between 75% and 100% more 
neoplastic than normal cells in equal sized monolayers by 5 days p.i. 

In one embodiment the invention provides a vector that is replication-competent in 
neoplastic cells and which overexpresses an ADP. Vectors useful in the invention include but 
are not limited to plasmid-expression vectors, bacterial vectors such as Salmonella species 
that are able to invade and survive in a number of different cell types, vectors derived from 
DNA viruses such as human and non-human adenoviruses, adenovirus associated viruses 
(AAVs), poxviruses, herpesviruses, and vectors derived from RNA viruses such as 
retroviruses and alphaviruses. Preferred vectors include recombinant viruses engineered to 
ovcrexpress an ADP. Recombinant adenoviruses are particularly preferred for use as the 
vector, especially vectors derived from Adl, Ad2, Ad5 or Ad6. 

Vectors according to the invention overexpress ADP. As applied to recombinant Ad 
and AAV vectors, the term *'overexpresses ADP** means that more ADP molecules are made 
per viral genome present in a dividing cell infected by the vector than expressed by any 
previously known recombinant adenoviral vector or AAV in a dividing cell of the same type. 
As applied to other, non-adenoviral vectors, **oveiexpresses ADP** means that the virus 
expresses sufficient ADP to l)rse a cell containing the vector. 

Vectors overexpressing ADP can be prepared using routine methodology. See, e.g., 
A Laboratory Cloning Manual^ 2nd Ed., v 1. 3, Sambrook et al., eds.. Cold Spring Harbor 
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Laboratory Press, 1989. For example, a polynucleotide encoding the AD? can be cloned into 
a plasmid expression vector known to efficiently express heterologous proteins in mammalian 
cells. The polynucleotide should also include appropriate termination and polyadenyiation 
signals. Enhancer elements may also be added to the plasmid to increase the amount of ADP 
S expression. Viral vectors overexpressing ADP can be prepared using similar materials and 
techniques. 

Where the virus is a recombinant adenovirus, overexpression of ADP can be achieved 
in a multitude of ways. In general, any type of deletion in the E3 region that removes a splice 
site for any of the E3 mRNAs will lead to overexpression of the mRNA for ADP, inasmuch 

1 0 as more of the E3 pre-mRNA molecules will be processed into the mRNA for ADP. This is 
exemplified in the KDl, KD3, GZl and GZ3 vectors (SEQ ID N0S:l-4) whose construction 
is described below. Other means of achieving overexpression of ADP in Ad vectors include, 
but are not limited to: insertion of pre-mRNA splicing and cleavage/polyadenylation signals 
at sites flanking the gene for ADP; expression of ADP fiDm another promoter, e.g. the human 

15 cytomegalovirus promoter, inserted into a variety of sites in the Ad genome; and insertion of 
the gene for ADP behind the gene for another Ad mRNA, together with a sequence on the 5' 
side of the ADP sequence that allows for internal initiation of translation of ADP, e.g. the Ad 
tripartite leader or a viral internal ribosome initiation sequence. 

The ADP expressed by a vector according to the invention is any polypeptide 

20 comprising a naturally-occurring full-length ADP amino acid sequence or variant thereof that 
confers upon a vector expressing the ADP the ability to lyse a cell containing the vector such 
that replicated copies of the vector are released from the infected cell. A preferred full-length 
ADP comprises the ADP amino acid sequence encoded by Adl, Ad2, Ad5 or Ad6. These 
naturally-occumng ADP sequences are set forth in SEQ ID NOS:5-8, respectively. ADP 

25 variants include fragments and deletion mutants of naturally-occurring adenovirus death 
proteins, as well as full-length molecules, fragments and deletion mutants containing 
conservative amino acid substitutions, provided that such variants retain the ability, when 
expressed by a vector inside a cell, to lyse the cell. 

Conservative amino acid substitutions refer to the interchangeability of residues 

30 having similar side chains. Conservatively substituted amino acids can be grouped according 
to the chemical properties of their side chains. For example, one grouping of amino acids 
includes those amino acids having neutral and hydrophobic side chains (A, V, L, I, P, W, F, 
and M); another groining is those amino acids having neutral and polar side chains (G, S» T, 
Y, C, N, and Q); another grouping is those amino acids having basic side chains (K, R, and 

35 H); another grouping is those amin acids having acidic side chains (D and E); another 
grouping is those amino acids having aliphatic side chains (G, A, V, L, and I); another 
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grouping is those amino acids having aliphatic-hydroxyl side chains (S and T); another 
gr uping is those amino acids having amine-containing side chains (N, Q, K, and H); 
another grouping is those amino acids having aromatic side chains (F, Y, and W); and another 
grouping is those amino acids having sulfur-containing side chains (C and M). Preferred 
5 conservative amino acid substitutions groups are: R-K; E-D, Y-F, L-M; V-1, and Q-H. 

As used herein, an ADP variant can also include modifications of a naturally- 
occurring ADP in which one or more amino acids have been inserted, deleted or replaced with 
a different amino acid or a modified or unusual amino acid, as well as modifications such as 
glycosylation or phosphorylation of one or more amino acids so long as the ADP variant 

1 0 containing the modified sequence retains cell lysing activity. 

As described below, the inventors herein performed a structure-function analysis of 
ADP that defined specific domains in ADP required to promote cell death. Using this 
information, when combined with known recombinant DNA and cloning methodology, it is 
believed the skilled artisan can readily construct ADP variants of a naturally-occurring 

15 adenovirus death protein and test them for cell lysing activity. A preferred ADP deletion 
mutant comprises an ADP amino acid sequence from any of the deletion mutants Jr716, 
rf/715, dni4 and rf/737, whose ADP sequences are set forth in SEQ ID NOS:9-l2, 
respectively). 

Where the vector is derived from a virus, it is preferred that the virus lack expression 

20 of one or more viral proteins involved in avoiding host anti-viral defenses such as immune- 
mediated inflanmiation and/or apoptosis of infected cells. For example, adenovirus contains a 
cassette of genes that prevents killing of Ad- infected cells by the immune system (Wold et al., 
Semin. Virol, 1998 (8:515-523, 1998). The E3-I4.7K protein and the E3 RID (Receptor 
Internalization and Degradation) protein, which is a complex consisting of RIDa and RDOP, 

25 inhibit apoptosis of Ad-infected cells induced by tumor necrosis factor (TNF) and the Fas 
ligand which are expressed on, or secreted by, activated macrophages, natural killer (NK) 
cells, and cytotoxic lymphocytes (CTLs) (Tollefson et al., Nature 392:727-730, 1998). The 
E3-gpl9K protein inhibits CTL-killing of infected cells by blocking transport of MHC class I 
antigens to the cell surface (Wold et al., supra). Thus, it is believed that infection of tumor 

30 cells by such viral vectors will stimulate infiltration of inflammatory cells and Ijonphocytes 
into the tumor, and will not prevent infected tumor cells from apoptosis induced by cytolytic 
cells of the immime system, or against apoptosis inducing cytokines. For example, it is 
known that when mice are infected with Ad mutants lacking the E3 gpl9K, RID and 14.7K 
proteins there is a dramatic increase (as compared to £3-positive Ad) in infiltration of 

35 inflammatory cells and lymphocytes into the infected tissue (Sparer et al., J. Virol 70:2431- 
2439, 1996). A similar infiltration of tumors infected by an ADP-expressing viral vector of 
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the invention would be expected to ftarther promote destruction of the tumor by adding an 
immune system attack to the ADP-mediated kiUing activity. For example, it is believed that 
the viral infection will stimulate formation of tumor-specific CTL's that can kill neoplastic 
cells not only in the tumor but also ones that have metastasized. In addition, it is also 
5 expected that vector-specific CTL's will be generated which could attack vector-infected cells 
if the vector spreads away &om the tumor into normal cells. Because viral vectors 
overexpressing ADP will spread rapidly through the tumor, it is believed these immune 
mechanisms will have little effect on spread of the vector. 

Where the vector is a recombinant adenovirus, it is preferred that the adenovirus lack 
10 expression of each of the E3 gpl9K, RID, and 14.7K proteins. By "lack expression" and 
"lacking expression" of a protein(s), it is meant that the viral genome contains one or more 
mutations that inactivates expression of a functional protein, i.e., one having all the functions 
of the wild-type protein. The inactivating mutation includes but is not limited to substitution 
or deletion of one or more nucleotides in the encoding gene{s) (hat prevents expression of 

1 5 functional transcripts or that results in transcripts encoding nonfunctional translation products. 
A particularly preferred way to inactivate expression of the Ad E3 gpl9K, RID, and 14.7K 
proteins is by deleting the E3 region containing the genes encoding these proteins. 
Preferably, one or both of the E3 genes encoding the E3 6.7K and 12.5K proteins are also 
deleted because, as discussed in the Examples below, it is believed that deletion of most or all 

20 of the E3 genes other than the ADP gene facilitates ovcrexpression of ADP mRNA by 
reducing competition for splicing of the major late pre-mRNAs. Preferred Ad vectors 
containing an E3 deletion that overexpress ADP are GZl (SEQ ID NO:3) and GZ3 (SEQ ID 
NO:4), whose construction and properties are described in the Examples below. 

The invention also provides ADP-expressing vectors whose replication is restricted to 

25 dividing cells. Any means known to provide such a replication-iestricted phenotype may be 
used. For example, WO 96/40238 describes microbes that preferentially invade tumor cells 
as well as methods for identifying and isolating bacterial promoters that are selectively 
activated in tumors. It is also contemplated that expression of one or more vector proteins 
essential for replication can be placed under the control of the promoter for a cellular gene 

30 whose expression is known to be upregulated in neoplastic cells. Examples of such genes 
include but are not limited to: the breast cancer markers mammaglobin (Watson et al., 
Oncogene /d:817-824, 1998); BRCAl (Norris et al.,y. BioL Chem, 270:22777-22782, 1995) 
herl/neu (Scott et al.,y. BioL Chem. 2dP: 19848-19858, 1994); prostate specific antigen (U.S. 
Patent 5,698,443); surfactant protein B for lung alveoli (Yan et al., Jl Biol Chem. 270:24852- 

35 24857, 1995); factor VH for liver (Greenberg et al, Proc. Natl Acad. ScL USA 92:12347- 
12351, 1995); and survivin for cancer in general (Li et al.. Nature iP(f:580-584). Where the 
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vector is an adenoviniSt it is contemplated that such tumor-specific promoters can be 
substituted for the E4 promoter. Because E4 gene products arc essential for Ad replication, 
placing their expression under the control of a tumor-specific promoter should restrict 
replication of the vector to turn r cells in which the promoter is activated. 
5 Another strategy for restricting replication of ADP-cxpressing Ad vectors to 

neoplastic cells is exemplified by the KDl (SEQ ID N0:1), KD2 (SEQ ID N0:13) and KD3 
(SEQ ED NO:2) vectors, whose construction and properties are described in the Examples 
below. This strategy exploits a pre-existing Ad5 mutant in the El A gene, named dl\ 101/1 107 
(Howe et al., Proc. Nail. Acad ScL, ^7:5883-5887, 1990), also referred to herein as rf/01/07, 

10 and which can only grow well in cancer cells. The role of El A is to drive cells firom the Go 
and G| phases of the cell cycle into S-phase. This is achieved by two mechanisms, one 
involving pRB (and family members), and the other involving p300 and the related protein 
CBP (DePinho, RA., Afaft/re i9/:533-536, 1998). One domain in El A binds members of the 
pRB family. pRB normally exists in the cell as a complex with the transcription factor E2F-1 

15 and E2F family members (E2F), tethered via E2F to E2F binding sites in promoters of cells 
expressed in S-phase. Here, pRB acts as a transcriptional co-repressor. El A binding to pRB 
relieves this repression, and causes the release of E2F firom pRB/E2F complexes. Free E2F 
then activates promoters of genes expressed in S-phase, e.g. thymidine kinase, ribonucleotide 
reductase, etc. Another domain in El A binds the p300/CBP transcription adaptor protein 

20 complex. p300/CSP is a transcriptional co-activator that binds many different transcription 
factors and accordingly is targeted to promoters. p300/CBP has intrinsic histone 
acetyltransferase activity. El A binding to p300/CBP is believed to inhibit this histone 
acetyltransferase activity, allowing acetylation of histones and repression of transcription 
(Chakravarti et al.. Cell Pd:393-403, 1999; Hamamori et al., Cell 9^:405-413, 1999). 

25 Conceivably, some of the goies that are repressed as a result of El A interacting with 

p300/CBP to play a role in blocking the cell cycle, although this is not known. Cancer cells 
are cycling, so they have fret E2F and presumably some p300/CBP-regulated genes are 
repressed. Consistent with these ideas. El A must bind both p300/CBP and the pRB family in 
order to transform primary cells to a constitutively cycling state (Howe et ah, supra). The 

30 mutant J/01/07 lacks both the p300/CBP- and pRB-binding domains and, as expected, it 

replicates very poorly in non-dividing "normal" cells or serum-starved cancer cells, but well 
in growing cancer cells. As described below, the growth of the KDl and KD3 vectors, which 
contain the J/01/07 El A mutation, is very much better in dividing cancer cells as compared to 
non-dividing cells. Because the d/01/07 mutant is completely defective in oncogenic 

35 transformation of rat cells (Howe t la., supra\ vectors according to the invention that contain 
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this El A mutation cannot induce cancer in humans (remote as that may be) through an ElA- 
dependent mechanism. 

The invention also includes vectors overexpressing ADP whose replication is 
restricted to specific tissues by placing expression of one or more proteins essential for 
5 replication under the control of a tissue specific promoter and/or a tumor specific promoter. 
A number of tissue-specific and/or tumor specific promoters have been described in the art. 
Non-limiting examples include the surfactant protein B promoter, which is only active in cells 
containing the TTFl transcription factor (i.e., type II alveolar cells (Yan et aL, supra)), as 
described in U.S. Patent 5,466,596 to Breitman et al., which directs gene expression 

1 0 specifically in cells of endothelial lineage; prostate specific antigen which is expressed in 
prostate cells (Rodriguez et al., supra); human telomerase protein (hTERT) promoter (see, 
e.g., U.S. Patent No. 6,054,575); and human alpha-lactalbumin gene which is expressed in 
breast cancer cells (Anderson et al., Gene Therapy 6:854-864, 1999). Many other tissue- 
specific, tumor specific, or tissue-preferred enhancer/promoters have been reported (Miller 

15 and Whelan, Human Gene Therapy 5:803-815, 1997). As exemplified with the surfactant 

protein B promoter in Examples 6 and 10, vectors expressing tissue-specific promoters would 
be expected to show tissue specificity in viral replication, viral spreading, cell lysis, and 
tumor suppression. 

Replication of vectors according to the invention can also be controlled by placing 

20 one or more genes essential for vector replication under the control of a promoter that is 
activated by an exogenous inducing agent, such as metals, hormones, antibiotics, and 
temperature changes. Examples of such inducible promoters include but are not limited to 
metallothionein promoters, the glucocorticoid promoter, the tetracycline response promoter, 
and heat shock protein (hsp) promoters such as the hsp 65 and 70 promoters. 

25 The invention also provides compositions comprising a recombinant vector that 

overexpresses ADP in an amount effective for promoting death of neoplastic cells and a 
method comprising administering a therapeutically effective amount of the vector to a 
neoplastic cell in a patient. It is believed the compositions and methods of the present 
invention are useful for killing neoplastic cells of any origin and include neoplastic cells 

30 comprising tumors as well as metastatic neoplastic cells. 

It is also contemplated that ADP-expressing viral vectors can be administered to 
neoplastic cells along with a replication-defective virus that expresses an anti-cancer gene 
product. For example, many repUcation-defective El' Ad vectors for use in cancer therapy 
are well characterized. A limitation of replication-defective vectors is that they only 

35 synthesize the therapeutic protein in the cell they initially infect, they cannot spread to other 
cells. Also, since the genome does not replicate, transcription can only occur from the input 
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genomes, and this could be as low as one copy per cell. In contrast, the genome of 
replication-competent Ad vectors are amplified by about 10^ in the cell that was initially 
infected, providing more templates for transcription. More amplification is achieved as the 
vector spreads to other cells. By combining replication-defective viral vectors expressing an 
5 anti-cancer gene product with replication-competent viral vectors described herein, it is 
expected that the result will be template amplification and rapid spread of both vectors to 
surrounding cells. For example, with Ad-based vectors, the burst size for each vector should 
be large, -10* PFU/cell, so the probability of co-infection of suirounding cells by both vectors 
will be high. Thus, both the replication-competent and replication-defective vectors should 

1 0 spread simultaneously through the tumor, providing even more effective anti-cancer therapy. 

As an alternative method of delivering an anti-cancer gene product with an ADP 
overexpressing Ad vector, the anti-cancer gene can be engineered into any of the ADP 
ovcrexprcssing replication-competent vectors described herein, in order to provide both the 
ADP and the anti-cancer function in a single vector. The anti-cancer gene can be engineered 

1 5 into any appropriate location of the vector, as can be easily determined by the skilled artisan. 
For example, the anti-cancer gene can be engineered into the E3 region. 

Expression of the anti-cancer gene product encoded by the replication-defective 
vector can be under the control of either constitutive, inducible or cell-type specific 
promoters. The anti-cancer gene product can be any substance that promotes death of a 

20 neoplastic cell. The term "gene product" as used herein refers to any biological product or 
products produced as a result of the biochemical reactions that occur under the control of a 
gene. The gene product can be, for example, an RNA molecule, a peptide, a protein, or a 
product produced under the control of an enzyme or other molecule that is the initial product 
of the gene, i.e., a metabolic product. For example, a gene can first control the synthesis of an 

25 RNA molecule which is translated by the action of ribosomes into a prodrug converting 

enzyme which converts a nontoxic prodrug administered to a cancer patient to a cell-killing 
agent; the RNA molecule, enzyme, and the cell-killing agent generated by the enzyme are all 
gene products as the term is used here. Examples of anti-cancer gene products include but are 
not limited to cell-killing agents such as apoptosis-promoting agents and toxins; prodrug 

30 converting enzymes; angiogenesis inhibitors; and inununoregulatory molecules and antigens 
capable of stimulating an immune response, humoral and/or cellular, against the neoplastic 
cell. 

Apoptosis-promoting agents include but are not limited to the pro-apoptotic members 
of the BCL-2 family such as BAX, BAD, BID and BK, as well as antisense molecules which 
35 block expression of anti-apoptotic members of the BCL-2 family. Examples of 

immunoregulatory molecules are cytokines such as tumor necrosis factor, Fas/Apol/G>9S 
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ligand, tumor necrosis factor related apoptosis inducing ligand, interleukins, macrophage 
activating factor and interferon y. Angiogenesis inhibitors include but arc not limited to 
endostatin and angiostatin. Toxins include but arc not limited to tumor necrosis factor, 
lymphotoxin, the plant toxin ricin, which is not toxic to humans due to the lack of ricin 
5 receptors in animal cells, and the toxic subunit of bacterial toxins. Examples of pro-drug 
converting enzymes and pro-drug combinations are described in WO 96/40238 and include 
thymidine kinase and acyclovir or gancyclovir, and bacterial cytosine deaminase and 5- 
fluorocytosine. 

The therapeutic or pharmaceutical compositions of the present invention can be 
1 0 administered by any suitable route known in the art including for example by direct injection 
into a tumor or by other injection routes such as intravenous, subcutaneous, intramuscular, 
transdermal, intrathecal and intracerebral. Administration can be either rapid as by injection 
or over a period of time as by slow infusion or administration of slow release fonnulation. 
For treating tissues in the central nervous system, administration can be by injection or 

1 5 infusion into the cerebrospinal fluid (CSF). When it is intended that a recombinant vector of 
the invention be administered to cells in the central nervous system, administration can be 
with one or more agents capable of promoting penetration of the vector across the blood-brain 
barrier. Preferably, vectors of the invention are administered with a carrier such as liposomes 
or polymers containing a targeting moiety to limit delivery of the vector to targeted cells. 

20 Examples of targeting moieties include but are not limited to antibodies, ligands or receptors 
to specific cell surface molecules. 

Compositions according to the invention can be employed in the form of 
pharmaceutical preparations. Such preparations are made in a manner well known in the 
pharmaceutical art. One preferred preparation utilizes a vehicle of physiological saline 

25 solution, but it is contemplated that other pharmaceutically acceptable carriers such as 

physiological concentrations of other non-toxic salts, five percent aqueous glucose solution, 
sterile water or the like may also be used. It may also be desirable that a suitable buffer be 
present in the composition. Such solutions can, if desired, be lyophilized and stored in a 
sterile ampoule ready for reconstitution by the addition of sterile water for ready injection. 

30 The primary solvent can be aqueous or alternatively non-aqueous. 

The carrier can also contain other pharmaceutically-acceptable excipients for 
modifying or maintaining the pH, osmolarity, viscosity, clarity, color, sterility, stability, rate 
of dissolution, or odor of the formulation. Similarly, the carrier may contain still other 
pharmaceutically-acceptable excipients for modifying or maintaining release or absorption or 

35 penetration across the blood-brain barrier. Such excipients are those substances usually and 
customarily employed to formulate dosages for parenteral administration in either unit dosage 
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or multi-dose form or for direct inilision into the cerebrospinal fluid by continuous or periodic 
infusion. 

It is also contemplated that certain formulations containing ADP-expressing vectors 
are to be administered orally. Such formulations are preferably encapsulated and formulated 
5 with suitable carriers in solid dosage forms. Some examples of suitable carriers, excipients, 
and diluents include lactose, dextrose, sucrose, sorbitol, mannitol, starches, gum acacia, 
calciimi phosphate, alginates, calcium silicate, microcrystalline cellulose, 
polyvinylpyrrolidone, cellulose, gelatin, syrup, methyl cellulose, methyl- and - 
propylhydroxybenzoatcs, talc, magnesium, stearate, water, mineral oil, and the like. The 

1 0 formulations can additionally include lubricating agents, wetting agents, emulsifying and 
suspending agents, preserving agents, sweetening agents or flavoring agents. The 
compositions may be formulated so as to provide rapid, sustained, or delayed release of the 
active ingredients after administration to the patient by employing procedures well known in 
the art. The formulations can also contain substances that diminish proteolytic degradation 

1 5 and promote absorption such as, for example, surface active agents. 

The specific dose is calculated according to the approximate body weight or body 
surface area of the patient or the volume of body space to be occupied. The dose will also be 
calculated dependent upon the particular route of administration selected. Further refinement 
of the calculations necessary to determine the appropriate dosage for treatment is routinely 

20 made by those of ordinary skill in the art. Such calculations can be made without undue 

experimentation by one skilled in the art. Exact dosages are determined in conjunction with 
standard dose-response studies. It will be understood that the amount of the composition 
actually administered will be determined by a practitioner, in the light of the relevant 
circumstances including the condition or conditions to be treated, the choice of composition to 

25 be administered, the age, weight, and response of the individual patient, the severity of the 
patient's symptoms, and the chosen route of administration. Dose administration can be 
repeated depending upon the pharmacokinetic parameters of the dosage formulation and the 
route of administration used. 

The invention also contemplates passively immunizing patients who have been 

30 treated with a viral vector ovcrexpressing ADP. Passive immunization can include 

administering to the patient antiserum raised against the viral vector, or gamma-globulin or 
vector*specific purified polyclonal or monoclonal antibodies isolated firom the antiserum. 
Preferably, the patient is passively inununized after a time period sufficient for the viral 
vector to replicate in and spread through the tumor. 

35 Preferred embodiments of the invention are described in the following examples. 

Other embodiments within the scope of the claims herein will be apparent to one skilled in the 
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art from consideration of the specification or practice of the invention as disclosed herein. It 
is intended that the specification, together with the examples, be considered exemplary only, 
with the scope and spirit of the invention being indicated by tiie claims which follow the 
examples. 

5 Example 1 

This example illustrates the constniction and characterization of the KDl and KD3 
anti-cancer vectors. 

To construct KDl, the inventors deleted the entire E3 region of a unique plasmid, 
leaving behind only a unique PacI site for cloning. The starting plasmid was pCRII, 

1 0 purchased from Invitrogen, containing the AdS BamHIA fragment having a deletion of all the 
E3 genes; the E3 deletion is identical to that for KDl and GZ3, the sequences of which are 
given in SEQ ID NO: 1 and SEQ ID N0:4, respectively. The ADP gene from AdS was cloned 
into the PacI site, then built into the E3 region of the genome of the AdS El A mutant named 
dlQVQl, This was done by co-transfecting into human embryonic kidney 293 cells the 

15 aforementioned BamHIA fragment containing the ADP gene together with the overlapping 
EcoRIA restriction fragment obtained from dlOMQl. Complete viral genomes are formed 
within the cell by overlap recombination between the Ad sequences in the BamHIA fragment 
in the plasmid and the EcoRIA fragment. KD3 was constructed in the same way except the 
E3 gene for the 12*SK protein was retained in the starting plasmid. A vector named KD2, 

20 which marginally overexpress ADP, was also prepared. Plaques of each recombinant Ad 
were picked, screened, purified, expanded into CsCl-banded stocks, sequenced, titered, and 
characterized. GZl and GZ3 are Ad vectors that are identical to KDl and KD3, respectively, 
except that GZl and GZ3 have wild-type El A sequences as found in ADS or in the AdS 
mutant <f/309. GZl and GZ3 were constructed as described for KDl and KD3 except that the 

2S EcoRIA fragment of AdS was used for GZl and GZ3. 

KDl and KD3 were characterized in cell culture by infecting the human AS49 lung 
carcinoma cell line with high titer (1-8 x 10'° plaque forming units [PFXJ] per ml) virus stocks 
of one of these recombinant vectors, or with one of the control viruses dl0\/07, <//309, £//327, 
and AdS (wt). Fifty PFU per cell were used for each virus. The descriptions of these viruses 

30 as well as some other viruses used in these examples are presented in Table 1 . 
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Using a pol)anerase chain reaction (PCR)-based protocol, an in-frame stop codon was 
introduced into the gene for the E3-gpl9K protein in the E3 region of the Ad5 mutant dl309 
(Jones and Shenk, Cell 77:683-689, 1979). The mutagenesis was conducted using a Sunl- 
Bstl 1071 fragment, nucleotides 28,390 to 29,012 in the Ad5 genome, which was then 
5 substituted for the equivalent fragment in rf/309. rf/01/07 is the parent for KDl and KD3. In 
turn, the Ad5 mutant named rf/309 is the parent of rf/01/07, i.e. rf/309 is identical to rf/01/07 
except that ^/309 does not have the El A mutation. Both dlOV07 and dl309 have deletions of 
the genes for the E3 RIDo, RIDp and 14.7K proteins but retain the gene for ADP. The Ad5 
mutant dl327 has wild-type El A, it lacks the gene for ADP, and its lacks all other E3 genes 

1 0 except the one for the 1 2.5K protein. 

At 24 and 36 hours post*infection (h p.i.), proteins were extracted from the AS49 cells 
and analyzed for ADP by immunoblot using a rabbit antiserum against ADP (ToUefson et aL, 
J. ViroL 66:3633-3642, 1992). The results are shown in Figure 2. Much more ADP was 
detected at 24 and 36 h p.i. in KDl - and KD3-infected cells than in cells infected with 

1 5 rf/01/07. Also, much more ADP was synthesized by GZl and GZ3 than rf/309 or the other 
viruses. Most importantly, KDl, KD3, GZl, and GZ3 expressed much more ADP at 24 h p.i. 
than did dlOl/Ol or dl309 (Fig. 2). This result is consistent with an observation discussed 
below that the cells infected with KDl , KD3, GZl, or GZ3 lyse faster, and that these viruses 
spread from cell to cell faster than rf/01/07 or rf/309. It is noteworthy that KDl, KD3, GZl, 

20 and GZ3 express much more ADP at 24 and 36 h p.i. than the Ad5 mutant <//1520 (Fig. 2); 
J/1520 is the original name given to ONYX-015 (Heise et al. Nature Medicine 3:639-645, 
1997). As expected, no ADP was detected in ceils infected with pm734A (Fig. 2), a mutant 
that lacks amino acids 1 to 48 in ADP (Tollefson et al., J. Virol 70:2296-2306, 1996). 
Expression of the El A proteins by rf/01/07, KDl, KD2, and KD3 was slightly less than by 

25 Ad5, rf/309, or <//327, and as expected from the dlOl/07 deletion, the proteins were smaller 
(Fig. 3A). J/327 is isogenic with J/324 (Thimmappaya et al., 1982 Cell 37:543-5 1, 1983), 
and it lacks the gene for ADP and all other E3 proteins except the 12.5K protein. 

The amount of ADP detected in the KDl and KD3 infected cells is significantly 
higher than the amount detected in the J7309 infected cells (Fig. 2). If one takes into 

30 consideration the fact that the viruses with the El A mutation replicate somewhat slower, as 
evidenced in by the delayed appearance of the late proteins (Fig. 3B), it is clear that KDl and 
KD3 express much more ADP per viral genome present in the cell than dl309. This finding is 
supported by the fact that when A549 cells are coinfected with a virus containing the El A 
mutation and J/327, which lacks ADP but has wild-type El A, the replication rates of the El A 

35 mutant viruses speed up, as indicated by earlier appearance f late proteins (compare Figs. 3B 
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and 3D). Thus, ^//3 27 complements the El A mutation. In conclusion, these experiments 
demonstrate that ADP is dramatically overexpressed by KDI, KD3, GZl, and GZ3. ADP is 
marginally overexpressed by KD2 (not shown). 

Example 2 

5 This example illustrates that KDI and KD3 lyse cells more rapidly and spread from 

cell-to cell faster than other adenoviruses. 

The ability of KDI and KD3 to lyse cells was examined by a trypan blue exclusion 
cell viability assay which was performed essentially as described by Tollefson ct al., 7. ViroL 
70:2296-2306. 1996. In brief, A549 cells were mock-infected or infected with 20 PFU/cell of 
10 KDI, BCD3, rf/01/07, <//327 or rf/309. At various days p.i., the number of viable cells was 
determined using a hemocytometer (600 to 1000 cells were counted per time point) and the 
results are shown in Fig. 4. 

Only 25% of the KDI -infected cells and 9% of the KD3'infected cells were alive at 5 
days p.i. as compared to 44% of cells infected with rf/01/07, which has the same El A 

IS mutation as KDI and KD3. The KDI and KD3 vectors also lysed cells faster than dl309, 

which has a wild-type El A region. When infected with rf/327 (ADF, ElA*), 94% of the cells 
were alive after S days. When cell lysis was estimated by release of lactate dehydrogenase, 
KDI and KD3 once again lysed cells faster than rf/01/07 and J/309, and rf/327 caused little 
cell lysis (data not shown). Thus, ADP is required for efficient cell lysis, and over-expression 

20 of ADP increases the rate of cell lysis. 

As another means to measure cell lysis and to examine virus replication in cancer 
cells, separate groups of A549 cells were infected with 20 PFU/cell of KDI, KD3, rf/01/07, or 
J/309 and the amount of intracellular and extracellular virus was determined by plaque assay 
on A549 cells. At 2 days p.i., the total amount of virus formed in each group was similar, 2-4 

25 X 10® PFU/ml, indicating that replication of all the viruses is similar. However, when the ratio 
of extracellular to intracellular virus was calculated, the value for KDI and KD3 was 2-3 logs 
higher than for Ad5, d/309, or dIOl/Ol (data not shown). Thus, virus is released much more 
rapidly from cells infected with KDI and KD3, which overexpress ADP, than with viruses 
expressing wild-type amounts of ADP. 

30 The ability of KDI and KD3 to ^ad from ccU-to-cell was measured in a "cell 

spreading" assay. In this assay monolayers of A549 cells in a 48 well culture dish were mock- 
infected or infected with 10 ^ 10 ^ 10'', 10^ or 10 PFU/cell of rf/327, rf/309, Ad5, rf/01/07, 
KDI or KD3. At low PFU/cell, the viruses must go through two or three rounds of 
replication in order to infect every cell m the monolayer. At 1.0 and 10 PFU/cell, the 

35 monolayer should be destroyed by the virus that initially infected the cells. To assess the 
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amount of spread in the monolayers by 7 days p.i., crystal violet, which stains live cells but 
not dead cells, was added to the monolayers. The results are shown in Fig. 5. 

Remarkably, at 7 days p.i., the monolayer was virtually ehminated by KDl and KD3 
at 10"^ PFU/cell, whereas l.O PFU/cell was required with dlOl/Ol, dl309 and Ad5. This result 
5 attests to the potency of ADP in mediating cell lysis and virus spread in A549 cells. KDl and 
KD3 are also more effective that dlOMOl in killing other types of human cancer cell lines 
(most purchased from the American Type Culture Collection [ATCC]) as determined in this 
cell spreading assay. KDl and/or KD3 killed HeLa (cervical carcinoma), DUNS (prostate), 
and pC3 (prostate) cells at 10'^ PFU/cell, ME-180 (cervix) and Hep3B (liver) at 10 * PFU/cell, 

10 and Ul 18 (glioblastoma) and U373 (glioblastoma) at 10 PFU/cell From 10- to 100-fold 
moT€ dlOMOl was required to kill these cells (data not shown). These results indicate that 
KDl and KD3 may be effective against many types of cancer. 

An important aspect of the finding that ADP overexpressing vectors lyse cells at very 
low multiplicities of infection is that the multiplicity of infection in human tumors is likely to 

IS be low at sites distal to the sight of vector injection or distal to blood vessels that carry the 
vector to the tumor. Thus, ADP overexpressing vectors have an advantage over vectors that 
express less ADP or no ADP at all. 

Example 3 

This example illustrates that KDl and KD3 replicate poorly in non-growing non- 
20 cancerous cells. The rq>lication phenotype of KDl and KD3 was evaluated using 'iiormar^ 
HEL-299 human fibroblast cells, either growing in 10% serum or rendered quiescent using 
0.1% serum. All Ads should replicate well in growing cells, but viruses with the dlOl/07 ElA 
mutation should do poorly in quiescent ceils because ElA is required to drive them out of Go. 
dl309, which has wild-type ElA, should replicate well in both growing and growth-arrested 
25 cells. 

Cells were infected with 100 PFU/cell of KDl, KD3, J/01/07, or dl309. At different 
days p.i., virus was extracted and titered. In 10% serum, KDl, KD3, and J/01/07 rephcated 
well, reaching titers of 10*-10' PFU/ml, only slightly less than rf/309 (Fig. 6). However, in 
quiescent cells, replication of KDl, KD3, and J/01/07 was 1.5-2 logs lower than in growing 

30 cells, ranging from 10^ to 2 x 10^ PFU/ml. The titer of J/309 reached 10^ PFU/ml, nearly the 
level achieved in growing cells. At 10 days p.i., quiescent HEL-299 cell monolayers infected 
with 100 PFU/cell of KDl, KD3, or J/01/07 were intact, whereas those infected with J/309 or 
J/327, which have wild-type ElA, showed strong typical Ad cytopathic effect indicative of 
cell death (data not shown). Thus, replication of KDl and KD3 is severely restricted to 

35 growing cell lines. 
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The restriction associated with the dlOMOl El A mutation was also tested in primary 
human cells (purchased firom Clonetics) growing as monolayers. Bronchial epithelial cells 
(Fig. 7) and small airway epithelial cells were not killed by 10 PFU/cell of KDl, KD3, or 
dmiOl at 5 days p.i., whereas they were killed by 10 PFU/cell of J/309 or dmi (data not 
5 shown). Lung endothelial cells also were not killed after 10 days by KDl, KD3, or dlO\/Q7 at 
10 PFU/cell, but they were killed by I PFU/cell of d/309. These monolayers were 
subconfluent when initially infected, then grew to confluency. The exciting result here is that 
although these primary cells were growing, they did not support replication in this time frame 
and were not killed by KDl or KD3. Thus, it is believed these vectors will be icstricted to 
1 0 cancerous cells, and will have little to no effect on cells such as basal cells that are normally 
dividing in the body. In addition, it is unlikely that KDl and KD3 will affect dividing 
leukocytes because such cells are poorly infected by Ad. 

In summary, the above experiments demonstrate that KDl and KD3 lyse cancer cells, 
spread from cell-to-cell rapidly, and replicate poorly in quiescent and non-cancerous cells. 
1 5 These properties should make them useful in anti-cancer therapy. 

Example 4 

This example illustrates that KDl and KD3 inhibit the growth of human tumors in an 
animal model. 

We could not evaluate mouse or rat tumors in normal mice or rats because they are 

20 totally non-permissive. Human cancer cell lines growing in nude mice have been used by 

Onyx Pharmaceuticals (Richmond, CA) to evaluate the efficacy of ONYX-015, an Ad vector 
lacking expression of the ElB 55 kDa protein (Heise et al.. Nature Med, i:639-645, 1997). 
We have found that A549 cells, which were used in many of our cell culture studies, form 
excellent rapidly growing solid tumors when injected subcutaneously into nude mice. The 

25 average tumor reaches ca. 500 fil in four weeks, and is encapsulated, vascularized, and 
attached to the mouse skin (usually) or muscle. 

Nude mice were inoculated into each hind flank with 2 x 10^ A549 cells. After 1 
week tumors had formed, ranging in size from about 20 yl to 50 ^1. Individual tumors were 
injected three days later, and at subsequent weeks for 4 weeks (total of 5 injections), with 50 

30 ^1 of buffer or 50 ^1 of buffer containing 5 x lO' PFU of J/309, dlOVOl, KDl, KD3, or 

/7m734.1, with a total virus dose per tumor of 3 x 10* PFU. The mutant pm734.1 lacks ADP 
activity due to two nonsense mutations in the gene for ADP, but all other Ad proteins arc 
expected to be synthesized at wild-type levels (Tollefson et al., 7. ViroL 7(7:2296-2306, 1996). 
The efficacy of each virus (or buffer) was tested on six tumors. At weekly intervals, the 

35 length (L) and width (W) of tumors were measured using a Mitutoyo digital caliper. Tumor 
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volumes were calculated by multiplying L x W x W/2. This value was divided by the tumor 
volume at the time of the initial virus injection, the fold-increase in tumor growth was 
calculated, and the average for the six tumors was graphed. 

As shown in Fig. 8A, tumors that received buffer continued to grow, increasing about 
5 14-fold by 5 weeks. In contrast, tumors injected with J/309, which expresses normal amounts 
of ADP and lacks the E3 RID and 14.7K and proteins, only grew about 2.5-fold by 5 weeks. 
With p/n734.1, which lacks ADP, the tumors grew as well as those that received buffer. 
Thus, J/309 markedly decreases the rate of tumor growth, and ADP is required for this 
decrease. Tumorsinoculated with rf/0 1/07 grew about 8-foId over 5 weeks. Since d/0 1/07 is 

1 0 identical to J/309 except for the El A mutation, this result indicates that the El A mutation 
significantly reduces the ability of Ad to prevent growth of the tumors. This effect is 
probably due to a reduction in virus replication in the tumors resulting in lower ADP 
expression, but it could also reflect other properties of El A in the tumor cells, e.g. the 
inability of the mutant El A proteins to induce apoptosis. Most importantly, tumors 

1 5 inoculated with KD 1 or KD3 only grew about 2.5-fold. Thus, the overexpression of ADP by 
KDl and KD3 allows KDl and KD3 to reduce tumor growth to a rate markedly slower than 
J/01/07 (their parental control virus), and even to a rate similar to that of J/309. 

The finding that KDl and KD3 are as effective as wild-type Ad (i.e. J/309) in 
reducing the rate of A549 tumor growth is highly significant in the context of cancer 

20 treatment, inasmuch as KDl and KD3 are restricted to cancer cells whereas wild-type Ad 
does not have such a restriction. 

The tumors in Fig. 8 A received five injections of vectors, but only one dose of vector, 
in this case 5x10* of each of KD3 or GZ3, is sufficient to significantly reduce the rate of 
A549 tumor growth (Fig. 8B). 

25 We have also found that KD 1 and KD3 reduce the rate of growth in nude mice of a 

human liver cancer cell line, Hep3B cells. These cells form rapidly growing tumors that are 
highly vascularized. Nude mice were inoculated into each hind flank with I x 10^ of Hep3B 
cells. After tumors reached about 100 ^1, they were injected twice per week for 3 weeks with 
50^1 of buffer or 5 x lO' PFU of KDl, KD3, or J/309. There were typically 8-10 tumors per 

30 test virus. The tumor sizes were measured and the fold increase in size at 0 to 3.5 following 
the initial virus injection was graphed as described above for the A549 tumors. Tumors that 
received buffer alone grew 9-fold over 3 weeks and were projected to grow about 12-fold 
over 3.5 weeks (after 3 weeks the mice had to be sacrificed because the tumors were 
becoming too large) (Fig. 9). Tumors that received KDl or KD3 grew about 4-fold, 

35 establishing that KDl and KD3 reduce the growth of Hep3B tumors in nude mice. Turn rs 
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that were injected with J/309 grew 2-fold (Fig. 9). The finding that KDl and KD3 were 
somewhat less effective than dl309 is probably due to the fact that they do not grow as well as 
dI309 in Hep3B cells, as indicated by a cell spread assay in culture (data not shown). In any 
case, the important points are that KDl and KD3 are effective against the Hep3B tumors, and 
S that they contain the El A mutation that limits their replication to cancer cells. 

These results point to the potency of ADP as an anti-tumor agent when expressed in 
an Ad vector. It is highly probable that KDl and KD3 will provide significant clinical benefit 
when used to infect tumors growing in humans. 

Example 5 

10 This example illustrates the use of replication-defective Ad vectors in combination 

with KDl or KD3. 

It is well established that replication-competent (RC) viruses complement replication-- 
defective (RD) mutants. That is, when the same cell is infected, the competent virus will 
supply the protein(s) that cannot be made from the mutant genome, and both viruses will 

1 5 grow. To test the ability of KDl and KD3 to complement RD viruses, two RD vectors 
expressing P-galactosidase were constructed. The first, named Ad-f3-gal, has a cDNA 
encoding p-gal under the control of the Rous Sarcoma Virus promoter substituted for the 
deleted El region. Ad-P-gal also has the E3 region deleted, including the gene for ADP. The 
second, named Ad-|3-gal/FasL is identical to Ad-P-gal, except that it also expresses murine 

20 FasL from the human cytomegalovirus promoter/enhancer. These vectors were constructed 
by overlap recombination in human 293 cells that constitutively express the Ad El A and ElB 
genes and complement replication of the El -minus vectors. 

These RD vectors should infect and express P-gal in A549 cells, but should not 
replicate because the El A proteins are lacking. However, the vectors should replicate when 

25 cells are co-infected with RC Ads. To prove this, A549 cells were infected with 10 PFU/cell 
of Ad-p-gal alone, or with 10 PFU/cell of Ad-p-gal plus 10 PFU/cell of KDl, KD3, rf/Ol/07, 
<//309, or <//327. At 2 days p.i., virus was extracted and Ad-P-gal titers determined by P-gal 
expression in A549 cells. The yields are shown in Table 2 below. 
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Table 2 



Virus 


Yield 
(blue plaques per ml) 


Ad-p-gal 


IxlO^ 


Ad-p-gal+KDl 


2x10^ 


Ad-p-gal + KD3 


3x10* 


Ad.(3-gal + <//01/07 


4x10" 


Ad-3-gal+rf/309 


3x10* 


Ad-p-gal + <//327 


3.0 X 10' 



The data in Table 2 indicate that the complementing viruses increased the yield of Ad-P-gal 
by about 10^ 

5 A key feature of KDl and KD3 is that they spread from cell-to-cell faster than other 

Ads. Accordingly, they should complement the spread of Ad-P-gal. To test this, an 
infectious center assay was conducted. A549 cells were infected with Ad-P-gal plus KDl, 
KD3, or rf/01/07. After 2 h, cells were collected, diluted, and seeded onto monolayers of 
fresh A549 cells. After 4 days, the cells were stained with X-gal and the results are shown in 
10 Fig. 10. 

With Ad-p-gal alone, only the originally infected cell (before seeding) should be 
stained, and the vector should not spread to other cells on the seeded monolayer. This was 
indeed the case. In monolayers seeded with A549 cells infected with Ad-p-gal alone (dish 
shown in the top left of Fig. lOA) contained a number of individual blue cells (not visible in 

15 the print); examples are shown in the enlarged view Fig. lOB. However, when the 

monolayers were seeded with A549 cells coinfected with Ad-P-gal and KDl or KD3, there 
were numerous "comets" of blue cells (Fig. lOA). Each comet represents Ad-P-gal which has 
spread from one initially-infected cell. Most of the cells within a comet were stained with X- 
gal (Fig. IOC). Comets were also observed with rf/01/07, but not to the extent of KDl and 

20 KD3 (Fig. lOA). With rf/327 (ADP ), there was little spread from the originally infected cell 
(data not shown). In summary, KDl and KD3 not only complement the replication of Ad-P- 
gal, they also enhance its rapid spread. 

It is expected that KDl and KD3 will also complement and enhance the spread of RD 
vectors expressing anti-cancer therapeutic gene products, and this expectation can be readily 
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verified using the Ad-p-gal/FasL in replication and infectious center assays as described 
above. 

KDl and KD3 not only complement the replication of RD vectors in cell culture, they 
also do so in HepBB tumors growing in the hind flanks of nude mice. The RD vector used 
5 was AdLuc, an Ad that lacks the El and E3 regions, and has inserted into the El region an 
expression cassette where the firefly luciferase gene is expressed from the Rous Sarcoma 
Virus promoter (Harrod et al.. Human Gene Therapy 9:1 885-1898, 1998). The HepBB tumors 
were injected with 1 x lO' PFU of AdLuc plus buffer, or 1 x lO' PFU of AdLuc plus 5 x 10^ 
PFU of KDl, KD3, rf/01/07, or ^//309. After 2 weeks, mice were sacrificed and tumors 

1 0 excised. Proteins were extracted from the tumors and luciferase activity determined using a 
luminometer. The luciferase counts per tumor were 6,800 for AdLuc plus buffer, 1 13,500 for 
KDl, and 146,900 for KD3 (Fig. 1 1). Thus, KD3 and KDl respectively caused a 22-fold and 
17- fold increase in luciferase activity. This increase could be due to elevated synthesis of 
luciferase in cells that were initially coinfected the AdLuc and KDl or KD3, and it could also 

IS be due to spread of AdLuc from cell to cell in the tumor as mediated by KDl or KD3. 

In sunmiary, infecting a tumor with a replication-competent ADP-overexpressing 
vector according to the invention together with a RD vector expressing an anti-cancer gene 
product should greatly increase the amount of anti-cancer protein synthesized in the tumor 
thereby increasing the ability of the replication-defective vector to promote destruction of the 

20 tumor. 

Example 6 

This example illustrates the construction and characterization of a recombinant Ad 
vector according to the invention which is replication-restricted to cancerous type II alveolar 
cells. 

25 As demonstrated above, the dlO\/07 mutation in KDl and KD3 limits growth of these 

vectors to cancer cells. To further restrict their replication phenotype, the E4 promoter in 
each virus was deleted and replaced by the surfactant protein B (SPB) promoter to produce 
vectors named KDl-SPB (SEQ ID NO: 14), KD3-SPB (SEQ ID N0:15), and J/01/07-SPB 
(SEQ ID NO: 16). The SPB promoter is only active in cells containing the TTFl transcription 

30 factor, which has thus far been found primarily in type II alveolar cells of the human lung 
(Lazzaro et al., Development 775:1093-1 104, 1991). Thus, KDl-SPB, KD3-SPB, and 
<//01/07-SPB should be severely restricted to cancerous type n alveolar cells of the human 
lung. Many lung cancers are of this type. 

The KDl-SPB and KD3-SPB vectors were prepared as follows. The E4 promoter is 

35 located at the right end of the Ad genome (Fig. 1). Using a pCRII-based plasmid (Invitrogen) 
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containing the Ad5 DNA sequences from the BamHI site (59 map units) to the right hand end 
of the genome, and using and a PCR-based protocol, nearly all the transcription factor binding 
sites were deleted from the E4 promoter Ad5 base pairs 35,623 to 35,775 and replaced with a 
500 base pair fragment containing the SPB promoter (Yan et al., J, Biol Chem. 270:24852- 
5 24857, 1995). The fmal plasmids contain the E4-SPB substitution in the E4 region and the 
dlOl/Oly KDl, or KD3 versions of the E3 region, respectively, for the viruses (f/Ol/07-SPB, 
KDl-SPB, and KD3-SPB. These plasmids were co-transfected into 293 cells with a fragment 
containing the left portion of the genome of rf/01/07, and plaques were allowed to develop. 
Plaques were screened for the expected features, purified, then expanded into a stock. 

1 0 The A549-TTF1 cell line was developed in order to test the prediction that replication 

of rf/Ol/07-SPB, KDl-SPB, and KD3-SPB would be restricted to cancerous cells expressing 
the TTFl transcription factor. These cells were co-transfected with two plasmids, one in 
which TTFl is expressed from the CMV promoter, and the other coding for resistance to 
neomycin Resistant clones were isolated and shown to express TTFl activity as determined 

1 5 by transient transfection with a plasmid expressing chloramphenicol acetyltransferase from 
the TTFl -requiring surfactant protein C promoter. 

KDl-SPB and KDl were subjected to a standard plaque development assay on AS49- 
TTFl cells and parental A549 cells. The results arc shown in Fig. 12, With KDl-SPB on 
A549 cells, plaques were not visible after 8 days, only about 4% of the final number of 

20 plaques were seen after 10 days, and about 50% of final plaques were seen after 12 days. 
With KDl-SPB on A549-TTF1 cells, plaques were visible after 6 days, and about 60% of 
plaques were seen after 10 days. Thus, as expected, KDl-SPB grew significantly faster on 
the cells containing TTFl . KDl formed plaques more quickly than KDl-SPB on both A549 
and A549-TTF1 cells, indicating that the E4 promoter-SPB substitution is not as effective the 

25 wild-type E4 promoter in inducing Ad replication. However, this difference between KDl- 
SPB and KDl on A549-TTF1 cells is tolerable, with KDl-SPB delayed only about 1 day. 
Curiously, the final titer obtained for all virus stocks by day 16 was similar, indicating that 
A549 cells may contain a very small amount of endogenous TTFl activity. It is predicted that 
KD3-SPB and t//01/07-SPB will behave similarly to KDl-SPB when grown in A549-TTF1 

30 cells and A549 cells. 

The restriction of KDl-SPB to cells containing TTFl was further examined in a cell 
spread assay using H441 cells, a TTFl -expressing human pulmonary adenocarcinoma cell 
line (Yan et al., supra), and Hep3B cells, a liver cancer cell line not expected to express 
TTFl. Culture dish wells containing H441 or Hep3B cells were infected with KDl-SPB or 

35 KDl at multiplicities ranging from 10 to 10"^ PFU/cell. The H441 and Hep3B cells were 
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stained with crystal violet at 5 days and 8 days p,i., respectively. KDl-SPB and KDl grew 
and spread equally well on H441 cells, causing destruction of the monolayer at 10*' PFXJ per 
cell (Fig. 13). (Some of the H441 monolayer has peeled off in the well with KDl-SPB at 10'^ 
PFU per cell, and in the wells with KDl and KDl-SPB at 10"^ PFXJ per cell; this occasionally 
5 occurs in cell spread assays, and it does not reflect virus infection). With HepBB cells, KDl 
grew and spread very much better than KDl-SPB, with 10"^ PFU per cell of KDl causing 
more destruction of the monolayer as 1.0 PFU per cell of KDl-SPB (Fig. 13). 

In summary, this example demonstrates that a replication-competent Ad, which 
replicates well on cells expressing the appropriate transcription factor, can be constructed 

10 with a tissue-specific promoter substituted in place of the E4 promoter. This methodology 
should be applicable to many other tissue specific and cell type specific promoters. One 
possibility would be a liver-specific promoter. Another possibility would be to use the E2F 
promoter, or another promoter with E2F sites, inasmuch as that promoter would be active 
only in cells such as cancer cells that have free E2F. A third possibility would be to use a 

1 5 regulatable promoter, e.g. the synthetic tetracycline response promoter (Massie et al., J. Virol 
72:2289-2296, 1998), where the activity of the promoter is controlled by the level of 
tetracycline or a tetracyclin analog in the patient. 

Example 7 

This example illustrates the construction and characterization of vectors which 
20 overexpress ADP and are not replication restricted. 

As demonstrated above, the dlOl/Ol El A mutation in KDl and KD3 is attenuating, 
inhibiting growth in non-dividing and even in dividing primary human epithelial and 
endothelial cells. Ads with this mutation are able to replicate well in dividing cancer cells. 
However, replication of such El A mutants is not as efficient as, e.g. dl309 which has a wild- 
type El A gene. For instance, the rate of replication of ^//0 1/07, as determined by the rate at 
which plaques develop, is reduced such that rf/01/07 plaques appear one day later than those 
of rf/309 (data not shown). This delay is due in part to a delay in expression of Ad late genes 
(see Fig, 3). The idea that the rf/01/07 mutation retards the rate of replication in A549 cells is 
further supported by the data in Fig. 8 A, where rf/01/07 did not prevent tumor growth nearly 
as well as rf/309. Despite this negative effect of the rf/01/07 El A mutation, there are 
theoretical and practical aspects of having this mutation in the KDl and KD3 vectors, as has 
been discussed. Nevertheless, one can easily imagine scenarios (e.g. patients with terminal 
cancer) where the ability of an Ad vector to destroy the tumor supercedes the requirement that 
the vector be totally restricted to tumor cells. In such cases, it would be advantageous to have 
vectors similar to KDl and KD3, but with the wild-type El A gene. The rates at which such 
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vectors express their genes, lyse cells, and spread from cell to cell should be higher than those 
of KDl and KD3. Such vectors might cause some damage to non-cancerous cells and tissue, 
but this is also true for other modes of anti-cancer treatment such as surgery, chemotherapy, 
and radiation therapy. 

5 In light of these considerations, vectors named GZl and GZ3 have been constructed 

that are identical to KDl and KD3, respectively, except they have a wild-type El A region. 
These vectors were constructed by overlap recombination in A549 cells. The left hand 
fragment contained the wild-type El A region of Ad5, and the right end fragment contained 
the E3 modifications of KDl or KD3. Plaques were picked, analyzed for the expected 

1 0 genotype, plaque-purified, and expanded into CsCl-banded stocks. The titers of these stocks 
on A549 cells were 2.9 x 10*° PFU/ml for GZl and 1.6 x lO'" PFU/ml for GZ3. Thus, these 
vectors can be grown into high titer stocks comparable to wild-type Ad. The GZl and GZ3 
plaques are larger and appear much sooner than the plaques for ^f/309. Large rapidly- 
appearing plaques reflect the ability of Ad to lyse cells and spread from cell-to-cell (Tollefson 

15 etal., / Virol, 7(^:2296-2306, 1996; Tollefson etal.. Virology 220:152-162, 1996), and this 
property, as discussed, is due to the fimction of ADP. 

The rate of plaque appearance can be quantitated in a plaque development assay 
(Tollefson et al., supra). Here, a typical plaque assay is performed, and the plaques observed 
on subsequent days of the assay are calculated as a percentage of the number of plaques 

20 observed at the end of the plaque assay. As shown in Fig. 14, after 4 days of plaque assay on 
A549 cells, GZl and GZ3 had 48% and 34%, respectively, of the final number of plaques, 
whereas dl209 had only 1%. It is very unusual in Ad plaque assays in A549 cells for plaques 
to appear after only 4 days. These large plaques reflect the overexpression of ADP. These 
GZl and GZ3 plaques appear sooner than those of KDl and KD3 (data not shown), no doubt 

25 because GZl and GZ3 replicate faster because they have a wild-type El A region. 

GZl and GZ3 lyse cells and spread from cell to cell much more effectively than 
rf/309. At 6 days p.i. of A549 cells, approximately as much monolayer destruction was 
observed with GZl and GZ3 at 10"^ PFU per cell as was observed with rf/309 at 10** PFU per 
cell (Fig. 15, top panel). This result fiirther underscores the conclusion that overexpression of 

30 ADP promotes cell lysis and virus spread. 

In theory, GZl and GZ3 should be able to replicate not only in tumor cells but also in 
normal cells. Although they can replicate in normal cells, it is quite possible that GZl and 
GZ3 may be usefril as anti-cancer vectors. First, GZl and GZ3 could be injected directly into 
the tumor. Many tumors are self-contained (encapsulated) except for the blood supply. The 

35 physical barriers of the tumor could minimize dissemination of the virus to other tissues. 
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Second, Ads are in general quite benign. Most infections of Ad5 are in infants and result in 
mild or asymptomatic disease, and are held in check by strong humoral and cellular 
immunity. Anti-Ad immunity appears to be life-long. GZl and GZ3 could be used only in 
patients who have an intact immune system, and perhaps also with pre-existing anti-Ad . 
5 immunity. Further, patients could be passively immunized against Ad, using gamma-globulin 
or even specific purified anti-Ad neutralizing antibodies. Third, considering that Ad5 is a 
respiratory virus which most efficiently infects lung epithelial cells displaying the specific 
Ad5 receptor (named CAR) as well as specific integrins (e.g. aybS), replication-competent 
vectors derived from Ad5 may not spread efficiently in many non-cancer tissues of the body. 
10 In addition, it is believed that versions of GZl and GZ3 can be constructed that have the £4 
promoter substituted with a tumor-specific, tissue-specific, cell-specific, or synthetic 
promoter. Such vectors would have the positive features associated with wild-type El A and 
ADP, and yet be replication-restricted to tumor tissue and/or to particular cell types. 

Example 8 

1 5 This example illustrates that the combination of KD 1 , KD3, GZ 1 , or GZ3 with 

radiation is more effective in destroying A549 cells, growing in culture or growing as tumors 
in nude mice, than the vectors alone or radiation alone. 

This was shown in a cell spread assay. A549 cells growing in three 48 well culture 
dishes were mock-infected or infected with different viruses at multiplicities of infection 

20 ranging from 1 0 to 1 PFU per cell as indicated in Fig. 15. One dish was not radiated. A 
second dish received 600 centrigreys (cGy) of radiation at 24 h p.i., and a third dish received 
2000 cGy of radiation at the same time. All dishes were stained with crystal violet at 6 days 
p.i. With the cells that were not radiated (top panel in Fig. 15), KDl and KD3 caused 
monolayer destruction at lower multiplicities of infection than their parental control, dl0\/07, 

25 This was also true for GZl and GZ3 as compared to their parental control rf/309. (The 

paucity of cells in the cells infected with GZl or GZ3 at 10"^ PFU per cell is an experimental 
artifact, and is not caused by infection by GZl or GZ3). These KDl, KD3, GZl and GZ3 
results are consistent with earlier results showing that overexpression of ADP leads to 
increased cell lysis and virus spread. 

30 With the dish that was infected then radiated with 600 cGy there was markedly 

increased cell killing and virus spread as compared to the non-radiated cells (compare the 
bottom panel of Fig. 15 with the top panel). For example, with KDl, KD3, GZl. and GZ3 
there was about the same amount of cell destruction in the radiated wells at 10*^ PFU per cell 
as in the non-radiated wells at 10*^ PFU per cell. Similar results were seen with the dish that 
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received 2000 cGy of radiation (data not shown), and also with dishes that received 600 or 
2000 cGy of radiation 24 h prior to infection (data not shown). 

The amount of cell destruction was quantitated by extracting the crystal violet from 
the cells with 33% acetic acid, tiien measuring the absorbance at 490 nm (data not shown), 
5 The absorbance with non-radiated mock-infected cells was set at 100% cell viability. With 
mock-infected cells that received 600 cGy there was a 15% loss in viability (i.e. 15% less 
crystal violet was extracted). With KDl at 10"^ PFU per cell, the non-radiated cells were 80% 
viable whereas the cells receiving 600 cGy of radiation were only about 30% viable. Similar 
differences in viability between radiated and non-radiated cells were seen with KD3, GZl, 
1 0 and GZ3. These results argue that the combination of radiation plus vector has a syngergistic 
effect on cell lysis and vector spread, rather than an additive effect. If the effect were only 
additive, then with the KDl samples at 10"^ PFU per cell, the cell viability should have been 
65% (15% reduction in viabihty due to radiation alone, 20% reduction due to KDl alone). In 
fact, the cell viability was 30% rather than 65%. 
15 As mentioned, approximately as much cell lysis and virus spread were observed with 

600 cGy as with 2000 cGy. To determine the optimal dose of radiation to synergize with the 
vectors, an experiment similar to the one described above was conducted with mock-, 
rf/01/07., KD1-, KD3-, d/309, GZ1-, or GZ3-infected A549 cells. The 48 well plates received 
0, 150, 300, or 600 cGy of radiation at 24 h p.i. Cells were stained with crystal violet. The 
20 results with cells receiving 0 versus 600 cGy of radiation were similar to those in Fig. 15. 
The crystal violet was extracted from the cells infected with 10'^ PFU per cell of the 
difference viruses. The absorbance of crystal violet was determined, and the percent cell 
viability was graphed, using the absorbance of the non-radiated mock-infected cells as 100% 
cell viability. As illustrated in Fig. 16, an approximately linear decrease in cell viability in all 
25 wells was obtained with increasing radiation dose, although the slope of the line was more 
negative with KDl, KD3, GZl, or GZ3 than with mock, d/01/07, or dl309. With KDl, KD3, 
GZl , and GZ3, there was much more cell lysis and vector spread with their parental control 
viruses, and there was synergy between the vectors and radiation. For example, with mock- 
infected cells, 600 cGy reduced cell viability by about 30% (70% of cells were viable). KDl 
30 without radiation reduced cell viability by about 23%. The combination of 600 cGy radiation 
plus KDl reduced cell viability to about 85%, more than 53% of which is the sum of radiation 
alone and KDl alone. When considering the data in Figs. 15 and 16 together, a dose of about 
600 cGy is optimal in this type of cell culture experiment. 

The combination of KD3 or GZ3 with radiation was also examined in the A549 
35 tumor-nude mouse model (see Example 4). A549 cells were injected into the hind flanks of 
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nude mice, and tumors were allowed to form. When tumors reached approximately 50-fil, 
they were injected with buffer or with 5x10^ PFU of KD3 or GZ3. Eight to ten tumors were 
injected per test condition. At 1 day p.i., half the mice received 600 cGy of whole body 
radiation. Tumor size was measured over time, and was plotted as a fold-increase in tumor 
5 size versus days p.i. (as described in Example 4). As shown in Fig. 17, the non-radiated 
buffer-injected tumors grew faster than those injected with KD3 or GZ3. Tumors that 
received the combination of KD3 and radiation did not grow, and those that received the 
combination of GZ3 and radiation shrank in size after 14 days. These results indicate that the 
combination of KD3 plus radiation or GZ3 plus radiation is more effective than either vector 
1 0 alone or radiation alone in reducing the rate of A549 tumor growth in nude mice. It is likely 
that radiation would increase the effectiveness in treating tumors of KDl and GZI, or indeed 
any other replication-competent or replication-defective Ad vector. 

The mechanism by which radiation causes the ADP overexpressing vectors to lyse 
cells and spread from cell-to-cell more effectively is not understood. Radiation is expected to 
1 S induce cellular DNA repair mechanisms, and that may allow for more efficient synthesis of 
Ad DNA. Radiation may enhance the function of ADP. ADP probably functions by 
interacting with one or more cellular proteins, and radiation may affect this protein(s) such 
that ADP functions more efficiently. 

It is believed that KDl, KD3, GZl, or GZ3, or any other replication-competent Ad 
20 vector, when used in combination with radiation, will be more effective than vector alone or 
radiation alone in providing clinical benefit to patients with cancer. The vectors should allow 
more tumor destruction with a given amount of radiation. Stated another way, radiation 
should cause more tumor destruction with a given amount of vector. These vectors should 
also allow the radiation oncologist to use less radiation to achieve the same amount of tumor 
25 destruction. Less radiation would reduce the side effects of the radiation. 

It is also believed that a cocktail of vectors when used in combination with radiation 
will be more effective than the cocktail alone or radiation alone. The cocktail could consist of 
ADP producing vectors plus one or more replication defective vectors expressing an 
anticancer therapeutic protein (see Example 5). 
30 Example 9 

This example illustrates a structure-function analysis of adenovirus death protein. 
ADP is an 1 1.6 kDa N-linked 0-linked integral membrane glycoprotein that localizes 
to the inner nuclear membrane (NM) (Scaria et al., Virology 191:743-753). As illustrated in 
Fig. 18, the Ad2-encoded ADP (SEQ ID N0:6) consists of 101 amino acids; aa 1-40 (SEQ ID 
35 NO: 1 7) are lumenal, aa 41 -59 (SEQ ID NO: 1 8) constitute the transmembrane signal-anchor 
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(SA) domain, aa 63-70 (SEQ ID NO: 19) constitute a basic proline (BP) domain within the 
nucleoplasmic (NP) domain, which constitutes aa 61-101 (SEQ ED NO:20). To determine 
which domains in ADP are required to promote cell death, a number of deletion mutants of 
reclOO were prepared which lacked various portions of the ADP gene and examined for the 
5 ability of ADP to localize to the NM and promote death. The reclOO virus is an Ad5-Ad-Ad5 
recombinant, which has been described elsewhere (Wold et ah, Virology 7^5:168-180, 1986). 

The structure of ADP in rec700 and in each deletion mutant is schematically 
illustrated in Fig. 18. The ADP gene in each deletion mutant has been sequenced using PCR 
methods to insure that the mutations are correct. The structure and activity of ADP in the 

10 deletion mutants was tested by infecting A549 cells followed by immunoblot analysis of the 
ADP mutant proteins as well as the ability to lyse cells. All deletion mutants expressed a 
stable ADP protein except pm734A (AM8, i.e. aa 1-48 are deleted). The pmllAJ (Nu) 
ADP, which has Asnn mutated to Ser, is O-glycosylated but not N-glycosylated because 
Asni4 is the only N-glycosylation site (data not shown). The dZ735 (A4-1 1) ADP is N- 

15 glycosylated but not O-glycosylated because the sites for O-glycosylation are deleted (data 
not shown). The /ww734.4 (M56) ADP, which has Metse in the SA domain mutated to Ser, 
contains exclusively N-linked high-mannose oligosaccharides (data not shown); this occurs 
because the Metse mutation precludes exit of ADP from the endoplasmic reticulum (ER). The 
£f/738 ADP, which lacks aa 46*60 in the signal-anchor domain, forms insoluble aggregates in 

20 the cytoplasm; therefore, aa 41-59 do in fact include the signal-anchor domain. The pmTiA 
(A 1-40) ADP, which initiates at Met4i the N-terminus of the SA domain, comigrated with 
the lower group of bands generated by proteolytic processing (data not shown). This 
indicates that the proteolytic cleavage sites occur near Met4i. Consistent with this, the 
proteolytic products were not seen with rf/737 (A29-45) (data not shown). Also, the size of 

25 the products decreased in all mutants with deletions within aa 41-101 (rff715.1, rf/715, J/714, 
rf/716) (data not shown). 

The. ability of these mutants to promote cell death was monitored by trypan blue 
exclusion, plaque development, and lactate dehydrogenase release assays (Tollefson et al., J. 
Virol 70:2296-2306, 1996). The trypan blue results in Fig. ISA indicate that the death- 

30 promoting function of ADP was abolished by deletion of aa MO (pm73A), aa 1 1-26 

(^/736.1), aa 18-22 (rf/735.1), or aa 4-1 1 ((//735). Mutation of the N-glycosylation site at 
Asn,4 (pm734.7) reduced the death-promoting activity to about 50% of rec700 (WT). rf/737 
(A29-45) was efficient as reclOO in promoting cell death; this indicates that the proteolytic 
processing products must not be required to promote cell death because they are not formed 

35 with dl737. The SA domain is essential for death because d/738 (A46-60) and pm734A 
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(M56) were completely defective (Fig. 19). J/715.1 was nearly completely defective, 
indicating that the BP domain is extremely important. Surprisingly, aa 71-94 (<//714), 76-89 
((//715), and 79-101 (J/716) could be deleted without affecting the death-promoting activity 
of ADP (Fig. 19). On the other hand, deletion of aa 81-88 (J/717) nearly completely 
5 abolished the activity of ADP (Fig. 19); this is probably the result of aberrant sorting of ADP 
(see below). Similar results were obtained when the ability of these ADP mutants to promote 
cell death was examined with standard plaque development, LDH-release and MTT assays. 

The effects of these mutations on the intracellular localization of ADP are extremely 
interesting. When examined by immunofluorescence (IF) at 33 h p.i. (data not shown), ADP 
1 0 from reclOO (WT) localized crisply to the NM; localization to the Golgi was also apparent. 
With dni4 (A71-94) and rf/715 (A76-89), ADP localized to all membranes, i.e. the ER, Golgi, 
plasma membrane, and NM. This was even more apparent at 45 h p.i. (data not shown) 
Thus, aa 7 1 -94 appear to include a signal that directs ADP specifically to the NM. ADP is 
very likely sorted from the trans-Golgi network (TON) to the NM, so this putative signal in 
1 5 ADP probably functions in this sorting pathway. ADP from dP 1 7 (A8 1 -88) is intriguing: it 
localized to the NM and Golgi, but in many cells "dots" and circular structures were observed. 
Again, this was more apparent at 45 h p.i. when these structures were the prominent feature. 
J/717-infectcd cells have not begun to die at 45 h p.i., so these structures are not cellular 
remnants. The intriguing possibility is that these structures are membrane vesicles that have 
20 pinched off from the TGN but are defective in targeting to and/or fusing with the NM. 

With J/738 (A46-60 in the SA domain), ADP aggregated in the cytoplasm. This 
again indicates that aa 46-60 include the SA sequence. With pm734A (M56), ADP localized 
primarily to the NM. As discussed above, the pmllAA ADP has exclusively high-mannose 
N-linked oligosaccharides, indicating that it never leaves the ER. Perhaps the putative NM- 
25 localization signal in the C-terminal region of the pmllAA ADP targets ADP to the NM by 
lateral diffusion from the ER (which is continuous with the outer and inner NM). 

With dmi (A29-45), ADP localized to the NM. ADP from pmlZA (AM0),/?ffi734.7 
(N14) (N-linked glycosylation cannot occur), and <//735 (A4-1 1; the O-glycosylation sites are 
deleted) localized much more prominently to the Golgi than the NM. ADP from rf/735.1 
30 (A 18-22) and dHZbA (Al 1-26) also localized much more strongly to the Golgi than the NM. 
Thus, residues 1-26 and/or glycosylation appear to be required for efficient transport of ADP 
from the Golgi/TGN to the NM. 

In summary, aa 41-59 include the SA domain, Met56 in the SA domain is required for 
exit from the ER, aa 1-26 are required for efficient exit from the Golgi, and aa 76-94 are 
35 required to target ADP specifically to the NM. With respect to promoting cell death, the 
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essential regions are aa 1-26, the SA domain (ADP does not enter membranes), Mets^ in the 
SA domain, and the BP domain (aa 63-70). It is not clear whether the defective death- 
promoting phenotype of pm734 (Al-40), dms (A4-1 1), J/735.1 (A18-22), dn36A (Al 1-26), 
and pml34.7 (N14) is due to lack of sequences (or oligosaccharides) that promote death or to 
5 much slower exit of ADP from the Golgi to the NM. dllU (A71-94) and f//715 (A76'89) 
express a wild-type phenotype for promoting death even though they are defective in 
localizing specifically to the NM; this is probably because sufficient ADP still enters the NM 
to promote death. Even though the deletion in dllM (A81-88) lies within the deletions in 
dlllS (A76-89) and J/714 (A71-94), the <//717 ADP is only about 15% as efficient as reclOO 
1 0 (WT), dl7 1 5 and J/7 1 4 in promoting death. This may be because the J/7 1 7 ADP tends to 

remain in vesicles rather than localizing to the NM. Altogether, these data indicate that ADP 
must localize to the NM in order to promote cell death. 

Example 10 

This example further characterizes the tissue specific Ad vectors described in Example 6. As 
1 5 discussed therein, the Ad E4 promoter is deleted and replaced with the promoter for surfactant 
protein B (SPB) in these vectors (Figure 24). 
Materials and Methods 

Cells, vectors and methods described in Example 6 were also used in this Example. 
In addition to the human cancer cell lines A549 (human lung carcinoma). Hep 3B (human 

20 hepatocellular carcinoma), and H441 (papillary lung adenocarcinoma) used in Example 6, 
HEK 293 cells (obtained fi-om Microbix (Toronto, ON)) and VKlO-9 cells were used. VklO- 
9 cells are 293 cells that in addition to El contain and express E4 and pDC. These cells will 
be referred to as 293-E4 cells. 

Experiments employing phase contrast microscopy of Hep 3B and H441 cells were 

25 performed as follows. Monolayers of Hep 3B or H44I cells were grown in 60 mm dishes 

with 5 ml of DMEM (10% FBS), and were mock-infected or infected with KDl or KDl-SPB 
at a multiplicity of infection of 10 plaque forming units (PFU) per cell. Phase contrast 
photographs of monolayers were taken at 4 and 7 days postinfection (p.i.). 

Experiments employing western blots of H441 or Hep 3B cells were performed as 

30 follows. H441 or Hep 3B cells (in 60 mm dishes) were infected with 10 PFU/cell of KDl or 
KDl-SPB. At 24 h p.i., the cells were washed three times with PBS and harvested by 
scraping. The cells were lysed by RIPA buffer. The protein concentration was measured by 
the BIO-RAD DC Protein Assay Kit (BIO-RAD Laboratories, Hercules, CA) and 10 jigof 
each sample were electrophoresed on 15% sodium dodecylsulfate polyacrylamide gels (SDS- 

35 PAGE). The gels were electroblotted onto PVDF membranes (Imm bilon, Millipore, 
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Bedford, MA). The membranes were blocked in TEST (50 mM Tris-Cl, pH 7.6, 150 mM 
NaCl, 0.2% Tween 20) containing 10% dry milk (Carnation) overnight at 4°C. After 
blocking, the membranes were incubated with a rabbit polyclonal antiserum against E40RF3 
(gift of Gary Ketner) or ADP (Tollefson et al., J. Viroi. <5d:363 3-3642, 1992), or with M73, a 
5 monoclonal antibody against El A (Harlow et al., J, ViroL 55:533-546, 1985). The secondary 
antibodies were goat anti-rabbit IgG-HRP or goat anti-mouse IgG-HRP. The blots were 
developed using the ECL protocol (Amersham Pharmacia, Arlington Heights, IL). 

Experiments employing a lactate dehydrogenase release assay for cell lysis (Tollefson 
et al., y. Virol. 70:2296-2306) were preformed as follows. H441 cells (7.7 x 10* cells per 35 

10 mm dish) and Hep 3B cells (9.0 x 10^ cells per 35 mm dish) were infected at 20 PFU/cell in 
one ml serum-free DMEM. After an adsorption period of 1 h, 3 ml of DMEM (10% FBS) 
were added (final FBS concentration of 7.5%). Cells were incubated at 37**C with 6% COj. 
At daily intervals, supematants were collected, microftiged to remove floating cells, and cell- 
free supematants were frozen at -70**C until assayed. Total lysis samples were prepared by 

1 5 addition of 1 OX lysis buffer included in the Cyto Tox 96 kit (Promega, Madison, WI). After 
all samples were collected, 20 ^1 samples were assayed in triplicate using the LDH assay kit 
Cyto Tox 96 and read on an EL340 Microplate reader (BioTecTM Instruments, Inc.) at 490 
nm. 

Experiments employing immunofluorescence evaluation of H441 and Hep 3B cells 

20 were performed as follows. H441 and Hep 3B cells were plated on Coming #1 coverslips in 
35 mm dishes. H441 (1.5 x 10^ cells/35 mm dish) and Hep 3B (9.0 x 10^ cells/35 mm dish) 
were infected with 20 PFU/cell of the indicated viruses in 1 ml serum-free DMEM. After 1 h, 
1 ml of DMEM/20% FBS was added (final concentration of 10% FBS). At the indicated 
times (48 h or 6 d p.i.), cells were fixed for 10 min in 3.7% paraformaldehyde in PBS, then 

25 permeabilized for 6 min in methanol (-20°C) and rehydrated in PBS. Coverslips were stained 
with rabbit antipeptide antiserum against the Ad E2A-coded DNA binding protein (DBP) 
(1:400 dilution; gift of Maurice Green) and mouse monoclonal antibody against fiber (1:400 
dilution; gift of Jeff Engler) or were stained with rabbit antiserum to E40RF3 (1:250 dilution; 
gift of Gary Ketner). Secondary antibodies (Cappel/ICN) were used at 1:50 dilution. All 

30 antibodies were diluted in PBS containing 1 % BS A and 0. 1 % sodium azide. Photographs 

were taken on a Nikon epifiuorescence microscope using a lOOX Planapo lens and Tmax 400 
film (Kodak). The film was developed in Diafine developer. 

Analysis of viral DNA replication by Southern hybridization was performed as 
follows. H441 and Hep 3B cells were grown in 60 mm dishes in DMEM supplemented with 

35 10% FBS. Cells were infected at 70% confluence with 10 PFU/cell of KDl or KDl-SPB. 
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Dishes were incubated in humidified 5% CO2 atmosphere at 37°C. Total genomic DNAs 
were isolated at 5, 24, 48, 72, and 96 h p.i. Equal amounts of total genomic DNAs were 
digested with Hindlll and resolved on a 1% agarose gel prior to transfer ont membranes. A 
random primer "P-labeled pBHGlO plasmid probe (Bett et al., Proc, Natl Acad, ScL USA 
5 P7:8802-88O6, 1994) was used for hybridization, and the blots were autoradiographed. DNA 
fragments were quantitated on a Molecular Dynamics Phosphorlmager. 

Virus yields were determined as follows. Hep 3B cells or H441 cells grown as 
monolayers in 35 mm dishes were infected with 10 PFU/ccll of KDl or KDl-SPB. At days 0 
to 4 (for H441) or days 0 to 9 (for Hep 3B) pj., cells and culture medium were frozen at - 
1 0 70^C. Samples were frozen and thawed three times to release the virus from the cells, and 
total virus yields were determined by plaque assay on A549 monolayers. 

The effect of KDl-SPB and KDl on H441 and Hep 3B tumors was examined in a 
nude mouse model (Doronin et al., J. Virol 7^:6147-6155, 2000). Tumor cells (10^ cells in 
200 \s\ of DMEM, 50% Matrigel [Becton Dickinson Labware, Bedford, MA] for H441 cells, 
1 5 or 10^ cells in 200 \A of DMEM plus 10% Matrigel for Hep 3B cells) were injected into flanks 
of 5-6 weeks old athymic nude mice and allowed to grow for three weeks to about 100 
(H441) or 150 fil (Hep 3B) volumes. Pre-established tumors (n = 10) were injected with 50 ^l 
of DMEM or 5 X 10^ PFU of indicated viruses in DMEM. Injections of the viruses were 
repeated twice weekly for 3 weeks to the total dose of 3.0 x 10* PFU per tumor. Tumor size 
20 measurements were taken twice per week for H441 cells, or weekly for Hep 3B cells using a 
Sylvac digital caliper. Tumor volumes were calculated in according to the formula: length x 
width^ /2. Data are represented as means of increase in tumor size relative to the tumor size at 
the initial injection. 
Results 

25 The properties of KD 1 -SPB in various cell types were compared to those of its 

"parent'*, KDl. Figure 25 shows the plaque development properties of these vectors on 293- 
E4, 293, and A549 cells. The data are plotted as the number of plaques seen on any day of 
the plaque assay as a percentage of the number of plaques seen at the end of the assay (i.e. 
when new plaques cease to appear) (Tollefson et al., y. Firo/. 70:2296-2306, 1966). This 

30 assay is an indicator of the size of the plaques. KDl formed plaques equally well on 293-E4 
and 293 cells (Figure 25 A). With KDl-SPB, plaques were observed about 3-4 days sooner on 
293-E4 compared to 293 cells (Fig. 2A). On A549 cells, KDl formed plaques 4-6 days 
sooner than KDl-SPB (Figure 25B). 

The properties of KDl-SPB versus KDl were characterized in detail in H441 cells, a 

35 human papillary lung adenocarcinoma cell line known to express the TTFl transcription 
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factor and in which the SPB promoter is active (Yan et al., / BioL Chem. 270:24852-24857, 
1995). Hep 3B ceils, a human hepatocellular carcinoma in which the SPB promoter should 
not be active, were used as a negative control. H441 and Hep 3B monolayers were infected 
with 10 PFU/cell of KDl or KDl-SPB and photographed at 4 and 7 days p.i. Mock-infected 
5 Hep 3B cells formed a relatively homogeneous monolayer, but H441 cells tended to form 
structures that resemble syncytia (Figure 26 A, B). As expected, KDl produced cytopathic 
effect (CPE) on both cell lines at 4 and 7 days p.i. (Figure 26A. B). Also as expected, KDl- 
SPB caused CPE on H441 cells but not on Hep 3B cells. Since CPE in Ad-infected cells is 
usually an indicator of virus growth, these results suggest that KDl-SPB grows in H441 but 
10 not in Hep 3B cells. 

To examine viral DNA replication, H441 and Hep 3B cells were infected with 10 
PFU/cell of KDl or KDl-SPB, then the accumulation of viral DNA was determined by DNA 
blot. With H441 cells, KDl and KDl-SPB DNAs were readily detected at similar levels at 
48-96 h p.i. (Figure 27A). With Hep 3B cells, KDl DNA levels were similar to those in 
15 H441 cells, but KDl-SPB DNA was barely detectable. This was confirmed by 
Phosphorlmager analysis of the DNA bands (Figure 27B). 

Growth of KDl-SPB and KDl in H441 and Hep 3B cells was determined by a single 
step growth assay. Cells were infected with 10 PFU/cell of vector, then total vector yield was 
determined by plaque assay. Total yield of both vectors was similar in H441 cells, reaching a 
20 plateau after 2 days (Fig. 28A). KDl yield plateaued in Hep 3B cells after 2-4 days p.i. 

(Figure 28B). However, KDl-SPB levels were about 5 logs lower in Hep 3B cells after 2-4 
days, and even by 9 days they had not achieved the levels of KDl. We conclude that KDl- 
SPB grows with significant specificity on H441 versus Hep 3B cells. Further, KDl-SPB 
grows as well as KDl on H441 cells, indicating that the E4 promoter deletion by itself does 
not significantly compromise the vector, and that the E4 promoter can be replaced by a tissue- 
specific promoter in a replication-competent vector. 

To obtain further details on the replication of KDl-SPB vs KDl in H441 and Hep 3B 
cells, the expression of representative Ad proteins by KDl-SPB and KDl was examined. 
H441 or Hep 3B cells were mock-infected or mfected with 10 PFU/ml of KDl or KDl-SPB, 
then at 24 h p.i. the proteins were extracted and the El A, E40RF3, and ADP proteins were 
examined by immunoblot. E40RF3 is one of the six proteins coded by the E4 transcription 
unit(Leppard,/ Gen. ViroL 75:2131-2138, 1997). As anticipated, KDl-SPB expressed 
E40RF3 well in H441 cells, but only at trace levels in Hep 3B cells (Figure 29). KDl-SPB 
expressed the ElA proteins in Hep 3B cells. Synthesis of ElA proteins by KDl-SPB in Hep 
3B cells is expected because ElA expression does not require E4 proteins; it also indicates 
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that the block to infection with KDl-SPB is downstream of El A. KDl expressed ElA in 
both cell lines, but the amount was less than obtained with KDl-SPB in Hep 3B cells (Figure 
29). The increased ElA levels seen with KDl-SPB may reflect its poor ability to enter the 
late phase of infection (see Discussion). KDl-SPB expressed ADP as well as KDl in H441 
5 cells, but it did not make detectable ADP in Hep 3B cells. ADP is primarily a late protein, so 
this result is consistent with the relative lack of E4 protein expression, DNA replication, and 
growth of KDl-SPB in Hep 3B cells. 

To gain insights into replication events that occur in individual cells, expression of 
E40RP3, the E2A-DBP, and the fiber late protein was examined by immunofluorescence, 

10 H441 or Hep 3B cells were infected with 20 PFU/celL At 48 h or 6 days p.i., cells were fixed 
and inununostained. E40RF3 was detected in the nuclei of H441 cells at 48 h p.i. with KDl, 
KDl-SPB. or dl309 (Figure 30A). (dl309 is an Ad5 mutant that has wild-type ElA, expresses 
Ad5 levels of ADP. and lacks the E3-RID and E3-14.7K genes). E40RF3 could not be 
detected in the vast majority of Hep 3B cells infected with KDl-SPB (Figure 30A). even at 6 

1 5 days p.i. (Figure 30B). Thus, KDl-SPB expresses E40RF3 well in H441 but not in Hep 3B 
cells. 

Figure 31 A shows double label immunofluorescence of DBF and fiber in the same 
Hep 3B cells at 48 h p.i. with KDl or KDl-SPB. With KDl, there was a strong speckled 
staining pattern in the nucleus that is typical for DBP at 48 h p.i. (Figure 31 A, top left panel). 

20 There was strong staining of fiber throughout these same cells (Figure 3 1 A, top right panel). 
Staining of the cytoplasm and nucleus is expected because fiber is synthesized in the 
cytoplasm and then transported to the nucleus where virions assemble. With KDl-SPB at 48 
h p.i., about 25% of the cells showed the speckled staining for DBP, and only one cell (7% of 
total) with the advanced speckled pattern was also stained for fiber (Figure 3 1 A, bottom two 

25 panels). Even at 6 days p.i., only about 30% of cells showed staining for DBP, and about 
20% for fiber (Figure 3 IB). Thus, markedly fewer Hep 3B cells infected with KDl-SPB 
expressed DBP and especially fiber as compared to KDl . These results indicate that KDl- 
SPB replicates as well as KDl in H441 cells, no doubt because the SPB promoter is active in 
H441 cells (Yanetal.,/ Bioi Chem, 27(7:24852-24857, 1995). KDl-SPB barely replicates 

30 in Hep 3B cells, presumably because the SPB promoter is minimally active in these cells. 

At the culmination of replication. Ad-infected cells are lysed and the virus spreads to 
other cells; this process is mediated in large part by ADP (ToUefson et al.. Virology 220:152- 
162, 1996; ToUefson etal.,y. ViroL 70:2296-2306. 1996). To examine vector-induced cell 
lysis, H441 and Hep 3B cells were mock-infected or infected with 20 PFU/cell of KDl, KDl- 

35 SPB, or dl309, and cell lysis was determined by release of lactate dehydrogenase (Tollefson et 
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al., J. Virol 7(?:2296-2306, 1996). All vectors lysed H441 cells beginning at 2-3 days p.i. 
(Figure 32A). KDl and dl309 also lysed Hep 3B cells in the same time period; however, 
KDl-SPB caused only minimal cell lysis (Figure 9B). Thus, these data, along with the cell 
spread data in Example 6 and Figure 13, demonstrate that KDl-SPB lyses cells and spreads 
5 efficiently from cell-to-cell in H441 but not Hep 3B cells. 

An experiment was conducted to determine whether KDl-SPB or KDl would 
suppress H441 tumors in nude mice. H441 cells were injected into each hind flank. When 
tumors had grown to about 100 (H44 1 ) or 1 50 ^il (Hep 3B), they were injected twice 
weekly for 3 weeks with DMEM (mock) or 5 x lO' PFU of test virus in 50 jil of DMEM (3.0 

10 X 10* total PFU). Ten tumors (5 mice) were used for each virus. Growth of H441 tumors was 
suppressed similarly by KDl-SPB and KDl (Figure 33 A). KDl suppressed growth of Hep 
3B tumors, whereas KDl-SPB caused only minimal suppression (Figure 33B). These results 
show that KDl-SPB is as effective as KDl in suppressing tumors when the SPB promoter is 
active. Further, the cell type specificity observed with KDl-SPB in vitro is maintained in 

15 vivo. 

Discussion 

Tumor specificity is one of the biggest challenges facing canca gene therapy, i.e. 
having the therapeutic gene be expressed specifically in cancer cells. Specificity is very 
important for RC viruses. Two main strategies have been described that in theory confer 

20 specificity: transductional targeting and transcriptional targeting. Directing specificity of 
vectors toward specific cell surface receptors on the target cells has been attempted through 
various methods. Although this approach is theoretically attractive it might encounter 
multiple obstacles such as the lack of incorporation of the engineered protein into the virion 
(Scaria et al.. Virology ] 91:143^753, 1992) or lack of infectivity through the targeted receptor 

25 (Cosset et al., J. Virol. dP:63 14-6322, 1995). Transcriptional targeting utilizes tumor and 

tissue specific promoters. In replication-defective vectors these regulatory sequences confine 
the expression of cytotoxic genes to specific tissues. In replication-competent vectors, as an 
added layer of regulation, vector replication per se can be placed under the control of tumor or 
tissue specific promoter/enhancer sequences. In replication-competent Ad, insertion of the 

30 tissue or tumor specific promoter/enhancer into the E 1 A promoter/enhancer region has been 
used exclusively (Hallenbeck et al., Hum, Gene Ther. 70:1721-1733, 1999; Rodriguez et al.. 
Cancer Res, 57:2559-2563, 1997; Yu et al.. Cancer Res. 59, 4200-4203, 1999; Yu et al., 
Cancer Res. 5P:1498-1504, 1999). The rationale behind these vectors is that expression of 
El A and therefore the whole Ad transcription program will depend on these tissue or tumor 

35 specific promoters. However, as a generic approach, there may be difficulties. The El A 
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enhancer/promoter is very complex. The enhancer controls not only the El A promoter but 
also distant promoters such as the E4 promoter (Shenk, T. pp. 21 11-2148 In B.N. Fields, 
D.M. Knipe, and P.M. Howley (eds.), Fields Virology, Lippincott-Raven, Philadelphia, 
1996). In addition, it has been shown that the El A enhancer in the inverted terminal repeat 
5 region changes tissue specificity of cellular promoters (Shi et al., Hum. Gene Ther. S'AOZ- 
410, 1997). Also, the El A enhancer/promoter is partially embedded within the signals 
required to package the Ad genome into virions, and it may be problematic to remove all the 
El A enhancer elements without impairing virus production. Accordingly, we chose to 
replace the E4 promoter with a tissue specific promoter. E4 genes are essential for Ad 
1 0 rephcation, and therefore we expected that the replication of the recombinant virus would be 
dependent on the tissue specific regulatory elements. 

To construct KDl-SPB, the ca. 300 bp of the E4 promoter was deleted and the B-500 
version (ca. 500 bp) of SPB promoter was inserted (Yan et al., siq)ra) (Figure 24 C, D). We 
selected the SPB promoter because of its strict tissue specificity: it is exclusively active in 
1 5 type II alveolar cells and bronchial epithelial cells of the lung (Bohinski et al., 1994, MoL 
Cell Biol 7^:5671-5681, 1994). Since the parental virus KDl contains and expresses two 
El A mutations that restrict virus replication to tumor cells (Doronin et al., supra\ we 
anticipated that the virus would selectively replicate in cells derived from lung tumors. Thus, 
H441 cells, a papillary lung carcinoma cell line, were used to characterize the replication, 
20 gene expression, and functional profile of KDl -SPB. 

KDl-SPB formed plaques 3-4 days sooner on 293-E4 cells that express E4 proteins 
than on 293 cells, whereas KDl formed plaques with the same kinetics on both cell lines. 
These data show that the E4 promoter is active in 293 cells, and that the SPB promoter 
displays very low activity in 293 cells. It is not clear why KDl-SPB forms plaques on 293 
25 cells; these cells are derived from human embryonic kidney and at least one of the 

transcription factors regulating the SPB promoter (Bohinski et al., supra), hepatocyte nuclear 
factor 3, is expressed in embryonic kidney. It is also possible that TTFl, the master 
regulatory factor of SPB expression, is minimally active in 293 cells. 

KDl grew to equally high titers in H441 and Hep 3B cells (Figure 28A, B). In 
30 contrast, KDl-SPB replicated as efficiently as KDl in H441 cells, in which the SPB promoter 
is active (Yan et al., supra) (Figure 28A), but replicated poorly in Hep 3B cells, most likely 
because the SPB promoter is inactive (Figure 28B). This selectivity has been confirmed by 
measuring viral DNA production in the two cell lines. KDl-SPB DNA replication was 
similar both kinetically and quantitatively to KDl DNA replication in H441, however in Hep 
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3B cells, KDl-SPB DNA was almost undetectable (Figure 27 A, B). The cytopathic effect, a 
surrogate marker of Ad replication, showed a similar specificity (Figure 26). 

To further confirm our predictions on the molecular basis of the observed issue 
specificity we monitored viral protein expression. When cells were infected with KDl-SPB 
5 all the viral proteins early or late, except for El A, were expressed in a tissue-specific fashion 
(high expression in H441, low to undetectable expression in Hep 3B) (Figures 29-31). We 
foimd a good correlation between the levels of E4 promoter activity (E40RF3 expression) 
and the expression of E2A-DBP, ADP, and fiber proteins. Thus, the SPB promoter retains its 
tissue specificity in the Ad genome and it seems to be the limiting factor of Ad gene 

10 expression in the cell lines tested. As expected, expression of El A is not tissue-specific. 
Thus, the regulatory step of tissue-specific Ad DNA replication is downstream of El A. In 
Hep 3B cells, KDl-SPB expressed El A at a higher level than did KDl (Figure 29), strongly 
suggesting that KDl-SPB replication in most of the Hep3B cells remains at the early stage. 
The cytolytic effect of KDl-SPB also showed a tissue-specific profile (Figure 32; 

15 Figure 13 of Example 6), i.e., preferential lysis of H441 cells over Hep 3B cells, a pattern 
similar to the specificity observed at the level of DNA replication (Figure 27) and viral 
protein synthesis (Figures 29-3 1). This cell type specificity was also observed when these 
cells were growing as tumors in nude mice. Growth of H441 tumors was suppressed by KDl- 
SPB and KDl at similar efficacy (Figure 33 A). In contrast, KDl-SPB unlike KDl had only 

20 minimal effect on the growth of Hep 3B tumors (Figure 33B). 

In simmiary, substitution of the E4 promoter with a tissue specific promoter allows 
highly tissue specific replication of Ad vectors and in the target tissue it is as efficient as the 
replication of the parental virus. KDl-SPB lacks all E3 genes except ADP. E3 gpl9K, RID 
and 14.7K have been shown to protect Ad-infected cells fi'om attack by cytotoxic 

25 lymphocytes and apoptosis-inducing cytokines such as tumor necrosis factor and Fas ligand 
(Wold et al., pp. 200-232 In A.J. Cann (ed.), DNA Virus Replication: Frontiers in Molecular 
Biology, Oxford University Press, Oxford, 2000; Wold et al., Curr. Opin, Immunol 77:380- 
386, 1999). 

The therapeutic index (virus produced in H441 cells compared to Hep 3B cells) of 
30 KDl-SPB is 10*-10^ for the first 4-5 days (Figure 28). These data compare to data reported 
by Calydon (10^-10^ for their prostate specific viruses (Rodriguez et al, supra\ Yu et al.. 
Cancer Res, 59, 4200-4203, 1999; Yu et al.. Cancer Res. 5P:1498-1504, 1999). We suggest 
that KDl-SPB has some added advantage over vectors reported by other laboratories because 
it encodes a mutant form of El A that restricts replication to cancer cells (Doronin et al., 
35 supra). 
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Although the lung ranks as the second highest cancer site for both men and women in 
the U.S. Reis et al., Cancer Res. 55:2398-2424, 2000), lung cancer has not been a major target 
for cancer vector gene therapy since intratumoral injection of virus is generally not feasible in 
the lungs. However, there has been a recent report of intratumor injection of a replication- 
S defective Ad vector into a lung tumor, and such an approach could be attempted with KDl- 
SPB. It may also be feasible to administer KDl-SPB systemically in the lung. 

In view of the above, it will be seen that the several advantages of the invention 
are achieved and other advantageous results attained. 

As various changes could be made in the above methods and compositions 
1 0 without departing from the scope of the invention, it is intended that all matter contained in 
the above description and shown in the accompanying drawings shall be interpreted as 
illustrative and not in a hmiting sense. 

All references cited in this specification, including patents and patent 
applications, are hereby incorporated by reference. The discussion of references herein is 
IS intended merely to summarize the assertions made by their authors and no admission is made 
that any reference constitutes prior art. Applicants reserve the right to challenge the accuracy 
and pertinence of the cited references. 
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What is Claimed Is : 

1 . A recombinant vector which is replication-competent in a neoplastic cell and 
which overexpresses an adenovirus death protein. 

2. The recombinant vector of claim 1 wherein the adenovirus death protein 
comprises amino acids 1-26, 41-59. and 63-70 of SEQ ID N0:5, SEQ ID N0:6, SEQ ID 
NO: 7, or SEQ ED N0:8 or a conservatively substituted variant thereof or wherein the 
adenovirus death protein comprises SEQ ID NO:5, SEQ ID NO:6, SEQ ID N0:7, or SEQ ID 

5 NO:8. 

3. The recombinant vector of claim 2 which comprises a recombinant virus. 

4. The recombinant vector of claim 3, wherein the recombinant virus is an 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gpl9K; RIDa; RIDp and 14.7K. 

5 . The recombinant vector of claim 4 which comprises SEQ ID N0:3 or SEQ 

IDNO:4. 

6. The recombinant vector of claim 3 which is replication-restricted to 
neoplastic cells. 

7. The recombinant vector of claim 6 which comprises SEQ ID NO: 1 or SEQ 

rDNO:2. 

8. The recombinant vector of claim 3, wherein the recombinant adenovirus 
comprises a tissue specific promoter, a tumor specific promoter, or an inducible promoter 
substituted for the E4 promoter. 

9. The recombinant vector of claim 8, wherein the tissue-specific promoter is a 
surfactant protein B promoter. 

10. The recombinant vector of claim 6 which comprises SEQ ID NO: 14, SEQ ID 
NO: 15 or SEQ ID NO: 16. 

1 1 . The recombinant vector of claim 1 , wherein the vector further comprises a 
gene encoding an anti-cancer product. 

12. The recombinant vector of claim 11, wherein the gene encoding an anti- 
cancer product is in the E3 region of the vector. 

13. A method for promoting death of a neoplastic cell comprising contacting the 
neoplastic cell with at least one vector which is replication competent in the neoplastic cell 
and which overexpresses an adenovirus death protein. 

14. The method of claim 13 wherein the adenovirus death protein comprises 
amino acids 1-26, 41-59, and 63-70 of SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, or SEQ 
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ID NO:8 or a conservatively substituted variant thereof or wherein the adenovirus death 
protein comprises SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, or SEQ ID NO:8. 

15. The method of claim 14, wherein the vector comprises a recombinant 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gpl9K; RIDa; RIDp and 14.7K. 

16. The method of claim 15, wherein the neoplastic cell comprises a tumor in a 
patient and the contacting step comprises administering the recombinant adenovirus to the 
tumor. 

17. The method of claim 16, further comprising the step of passively immunizing 
the patient against the recombinant adenovirus. 

18. The method of claim 17, wherein the recombinant adenovirus comprises SEQ 
IDNO:3orSEQIDNO:4. 

19. The method of claim 15, wherein the vector is replication-restricted to 
neoplastic cells. 

20. The method of claim 19, wherein the vector is a recombinant adenovirus 
comprising SEQ ID NO: 1 or SEQ ID NO:2. 

21 . The method of claim 15, wherein the recombinant adenovirus comprises a 
tissue specific promoter or an inducible promoter substituted for the E4 promoter. 

22. The method of claim 21, wherein the tissue specific promoter is a surfactant 
protein B promoter. 

23. The method of claim 22, wherein the recombinant adenovirus comprises SEQ 
ID NO: 14, SEQ ID NO: 1 5 or SEQ ID NO: 1 6. 

24. The method of claim 16, further comprising treating the tumor with radiation. 

25. The method of claim 24, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with radiation. 

26. The method of claim 16, further comprising treating the tumor with 
chemotherapy. 

27. The method of claim 26, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with chemotherapy. 

28. The method of claim 16, further comprising administering to the tumor one 
or more replication-defective adenovirus which expresses an anti-cancer gene product, 
wherein the recombinant adenovirus complements spread of the replication-defective 
adenovirus in the tumor. 

29. A composition comprising: 
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a first recombinant virus which is rq)lication competent in a neoplastic cell and 
overexpresses an adenovirus death protein; and 

a second recombinant virus which is replication defective and which expresses an 
5 anti-cancer gene product, 

wherein the first recombinant virus complements replication of the second recombinant virus. 

30. The composition of claim 29 wherein the first recombinant virus comprises a 
recombinant adenovirus lacking expression of at least one £3 protein selected from the group 
consisting of: gpI9K; RIDa; RIDp and 14.7K. 

3 1 . The composition of claim 30 wherein the recombinant adenovirus comprises 
a nucleotide sequence selected from the group consisting of: SEQ ID N0:1; SEQ ID N0:2; 
SEQ ID NO:14; SEQ ID N0:15; SEQ ID NO:16; SEQ ED N0:3; or SEQ ID NO:4. 

32. A composition comprising 

a first recombinant virus which is replication-defective in a neoplastic cell 
and which overexpresses an adenovirus death protein, and 

a second recombinant virus which is replication-competent in a neoplastic 

cell. 
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LOCUS adS comple 35935 bp DNA SYN 06-FEB-1999 

DEFINITION ad5 complete genome 
ACCESSION ad5 compXe 
KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown 

Unclassified. 
REFERENCE 1 (bases 1 to 35935) 

AUTHORS Self 

JOURNAL Unpubl i shed . 
BASE COtJNT 8367 a 10073 c 9761 g 7734 t 

ORIGIN 

1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTCACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCG CGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTIX3ACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGQTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TG TAGTG TAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 
601 AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 
661 TCCTAGCCAT TTTGAACCAC CTACCCTTCA CGAACTGTAT GATTTAGACG TGACGGCCCC 
721 CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 
781 GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 
841 CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 
901 CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACGA 
961 CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 
1021 CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 
1081 CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTA TGGG CAG TGGG TGA 
1141 TAGAGTGGTC GGTTTGGTOT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 
1201 GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 
1261 CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 
1321 CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 
1381 CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 
1441 GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 
1501 CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 
1561 TTGCGTGTGT GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 
1621 GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG GTATA TAATG 
1681 CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 
1741 TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGGAGG 
1801 TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTA AGGAG GA TTACAAGTGO 
1861 GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 
1921 CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 
1981 GCGGCTGCTG TTOCTTTTTT GAGTTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 
2041 AGOGGOGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 
2101 AAQAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACOGAC GGAGGAGCAG 
2161 CAGCAGCAGC AGOAGGAAGC CAGGCGGOGG CGGCAGGAGC AGAGCCCATG OAACCOGAGA 
2221 GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 
2281 GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAOCGGG 
2341 GGGCTTCTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG ACCAGACACC 
2401 GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 
2461 TGGOGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTG GCTGCAGCCA GGGGATGATT 
2521 TTGAGGAGGC TATTAGOQTA TATGCAAAGO TGGCACTTAG GCCAGATTGC AAGTACAAOA 
2581 TCAGCAAACT TGTAAATATC AGGAATTOTT GCTACATTTC TGGGAACGGG GCCGAGOTGG 
2641 AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT OATAAATATG TGOCCGGGOO 
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2701 TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTTTAGCG 
2761 GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC TATGGGTTTA 
2821 ACAATACXTTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC TTTTA CTGCT 
2881 GCTGGAA6GG GGTGGTGTCT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 
2941 AAAGGTGTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GG7GCGCCAC AATGTGGCCT 
3001 CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 
3061 GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTGTCACC 
3121 TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 
3181 ACATACTCAC CCX3CTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 
3241 AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCX3A GAGCATGTCC AAGGTGAACC 
3301 TGAACGGGGT GTTTCACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 
3361 GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCIGTGATGC 
3421 TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGC3GCT0AGT 
3481 TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 
3S41 TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCX3 
3601 CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTGACAACGC 
3661 GCATGCCX:CC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGGTCGCC 
3721 CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CXnXSTCTGGA ACGCCGTTGG 
3781 AGACTGCAGC CTCCGCCX3CC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 
3841 ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCXXSCXSATG 
3901 ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 
3961 CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCXTCTCCCA 
4021 ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 
4081 CTTGCTCTCT TTATTTAGGG GTTTTGCGCG CGCGGTAGGC CCGGGACCAG CGGTCTCGGT 
4141 CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 
4201 ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCQ 
4261 GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 
4321 CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 
4381 TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 
4441 TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 
4501 TOTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCX3 TGGAAGAACT 
4561 TGGAGACGCC CTTOTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 
4621 GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 
4681 CCAGGATGAG ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTC CCAGACTGOG 
4741 GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATT TCCCAC G 
4801 CTTTGAGTrC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA AC0GTTTCC2G 
4861 GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 
4921 CGQTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAGCTG CAGC 
4981 TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT CGCATGTTTT 
5041 CCCTGACCAA ATCCX3CCAGA AGGCGCTCGC CJGCCXZAGCGA T AGCAGT TCT TGCAAGQAAG 
5101 CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 
5161 GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCQA TCCAGCATAT 
5221 CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGQC AGTAGTCGQT GCTCGTCCAG 
5281 ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 
5341 GGTOAAGGGG TGCGCTCCX3G GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTCCT 
5401 GGTGCTGAAG CGCTGCCGGT CrTCGCCCrG CGCGTCGGCC AGGTAGCATT TGACCATGGT 
5461 GTCATAGTCC AQCCCCTCCG CX5GCOTGGCC CTTGGCX3CGC AGCTTGCCCT TGQAGGAGGC 
5521 GCCGCACGAQ GGGCAGTGCA GACTTTTOAG GGCGTAGAGC TTGGGCGCGA GAAATACCX3A 
5581 TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCXX5CAGACG GTCTC GCATT CCACGAOCCA 
5641 GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATOCTTTT TGATGCGTTT 
5701 CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACXJCTCGGTG ACX3AAAAGGC TGTCCGTOTC 
5761 CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTQTT CCGCXX3TCCT CCTCGTATAG 
5821 AAACTCX3GAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 
5881 GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 
5941 GTCGCXrCTCT TCGGCATCAA GGAAGGTGAT TGGTTrGTAG GTGTAGGCCA CGTGACCGGG 
6001 TGTTCCTCAA GGGGGGCTAT AAAAGGGGGT GGGGGOGCGT TCGTCCTCAC TCTCTTCOGC 
6061 ATCX5CTGTCT GCGAGGGCCA GCTGTTGGGG TOAOTACTCC CTCTGAAAAG COGGCATOAC 
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6121 TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGG CCCGC 
6181 GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 
6241 AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA TGGAGCGCAG 
6301 GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACX5TATTC 
6361 GCGCGCAACG CACCGCCATT CX3GGAAAGAC GGTGGTQCGC TCGTCGGGCA CCAGGTGCAC 
6421 GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 
6481 GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCX3AG CAGAATOGCG GTAGGGGGTC 
6541 TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCX5GGCA GCAGGCGCGC 
6601 GTCX3AAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG CX5CGGGCGGC 
6661 AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 
6721 GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 
6781 AGGGTAGCAT CTTCCACCGC GOATGCTGGC GCGCACGTAA TCGTATAGTT CGTGCGAGGG 
6841 AGCXaiGGAGQ TCGGGACCGA GGTTGCTACG GGCGGGCTGC TCTGCTCGGA AGACTATCTG 
6901 CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGACGT T GAAG CTGGC 
6961 GTCTGTGAGA CCTACCGCGT CACGCACGAA GGAGGCGTAG GAGTCGCGCA GCTTGTTGAC 
7021 CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT TGATGATQTC 
7081 ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGGACAAACT CTTCGCGGTC 
7141 TTTCCAGTAC TCTTGQATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC CTAGCATGTA 
7201 GAACTGGTTG ACXK3CCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG CGT ATGCC TG 
7261 CGCX3GCCTTC CX3GAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA TGACTTTOAG 
7321 GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA AAAAGTCOGT 
7381 GCGCTTTTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA GTATCTTTCC 
7441 CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTCGG AACXK5TTGTT 
7501 AATTACCTGG GCGGCGAGCA CGATCTCGTC AAAGCCGTTG ATGTTGTGGC CCACAATGTA 
7561 AAGTTCCAAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT CCTCGTAGGT 
7621 GA6CTCTTCA GGGGAGCTGA GCCCGTGCTC TGAAAGGGCC CAGTCTGCAA GATGAGGGTT 
7681 GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT GGTCGCGAAA 
7741 GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA AGGTAAGCGG 
7801 GTCTTGTTCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG CAGTCACTAG 
7861 AGGCTCATCT CCGCCGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT TCCCAAAGGC 
7921 CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG TGCXyVGGATG 
7981 CGAGCCGATC GGGAAGAACT GQATCTCCCG CCACCAATTG GAGGAGTGGC TATTGATGTG 
8041 GTGAAAGTAG AAGTCCCTGC GACGGGCCX3A ACACTCGTGC TGGCTTTTGT AAAAACGTOC 
8101 GCAGTACTGG CAGOGGTGCA CGGGCTGTAC ATCCTGCACG AG GTTGA CCT GACGACCGCG 
8161 CACAAGGAAG CAGAGTGGGA ATTTGAOCCC CTCGCCTGGC GGGTTTGGCT GGTGGTCTTC 
8221 TACTTCGGCT GCTTGTCCTT GACCGTCTGG CTGCTCGAGG GGAGTTACGG TGGATCGGAC 
8281 CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA GCTTGATGAC 
8341 AACATCXSCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA GGTCAQGCX3G 
8401 GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGCG CGGGCTAGAT CCAGGTGATA 
8461 CCTAATTTCC AGGGGCTOGT TGGTGGCGGC GTCGATGGCT TGCAAGAGGC CGCATCCCCG 
8521 CGGCX5CX3ACT ACGOTACCGC GCGGCX5GGCG GTGGGCCGCG GGGGTGTCCT TGGATGATGC 
8581 ATCTAAAAGC GGTCACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG ACCCX3CCX3GG 
8641 AGAGGGGGCA GGGGCACX3TC GGCGCCGCGC GCGGGCAGGA OCTOGTGCTO CGCGCOTAQO 
8701 TTGCTGGCGA ACGCGACGAC GCGGCGGTTG ATCTCCTGAA TCTGGCGCCT CTGCGTGAAG 
8761 ACGACX^GGCC CGGTCAGCTT GAGCCTGAAA GAGAGTTCGA CAGAATCAAT TTCGGTOTCG 
8821 TTGACGGCGG CCTCGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG ATAGGCGATC 
8881 TCGQCCATGA ACTOCTCGAT CTCTTCCTCC TGGAGATCTC CGCGTCCGGC TCGCTCCACG 
8941 GTOGC3GGCGA GQTCGTTGGA AATGCGGGCC ATGAGCTGCG AQAAGGCGTT GAGGCCTCCC 
9001 TCGTTCCAGA CGCGGCTOTA GACCACGCCC CCTTCGGCAT CX3CGGGCXKX3 CATGACCACC 
9061 TGCGCX3AGAT TGAGCTCCAC GTOCCGGGCG AAGACGGCX3T AGTTTCGCAG GCGCTGAAAG 
9121 AGGTAQTTOA GGGTGGTGGC GGTGTGTTCT GCCACOAAGA AGTACATAAC CCAGCGTCGC 
9181 AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGQCCTC GTAOAAOTCC 
9241 ACGGCGAAGT TGAAAAACTC GGAGTTGCGC GCCGACACGG TTAACTCCTC CTCCAOAAGA 
9301 CGGATGAGCT CGGCOACAGT GTCGCX3CACC TCGCGCTCAA AGGCTACAGG GGCCTCTTCT 
9361 TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT CTTCTTCTGG CGGCGGTGGG 
9421 GGAGGGGOGA CACGGCGGCG ACGACGGCGC ACXXSGOAQGC GGTCG ACAAA GCGCTCOATC 
9481 ATCTCCCCGC GGCX3ACGGC0 CATGGTCTCO GTGACGGOGC GGCOGTTCTC GOOGGGOOGC 
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9541 AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT GCCATGCGGC 
9601 AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTGTG TAGGTACTCC GCCGCCGAGG 
9661 GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GTCTAACCAG 
9721 TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCGGGCX3GCA GCGGGCGGCG GTCGGGGTTG 
9781 TTTCTCGCGG AGGTGCTGCT GATGATGTAA TTAAAOTAGG CGGTCTTGAG ACXK3CGGATG 
9841 GTCGACA6AA GCACCATGTC CTTGGGTCCG GCCTGCTOAA TGCXSCAGGCG GTOGGCCATG 
9901 CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT GAGCCTTTCT 
9961 ACCGGCACTT CTTCTTCTCC TTCCTCTTGT CCTGCATCTC TTGCATCTAT CGCTGCGQCG 
10021 GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC CCCGAAGCCC 
10081 CTCATCGGCT GAAGCAGGGC TAGGTOGGCG ACAACGCGCT CGGCTAATAT GGCCTGCTGC 
10141 ACCTCCOTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGCGGTGGTA TGCGCCCGTG 
10201 TTGATGGTGT AAGTOCAGTT GGCCATAACG GACCAGTTAA CX3GTCTGGTG ACCCGGCTGC 
10261 GAGAGCTCGG TGTACCTGAG ACGCXSAGTAA GCCCTCGAOT CAAATACGTA GTCGTTGCAA 
10321 GTCCGCACCA QGTACTGGTA TCCCACCAAA AAGTGCGGCG GCGGCTGGCG GTAGAGGGGC 
103B1 CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG ATGATATCCG 
10441 TAGATGTACC TGGACATCCA GGTGATGCCG GCGGCX3GTGG TGGAGGCGCG CXK3AAAGTCG 
lOSOl CGGACGCGGT TCCAGATGTT GCGCAGCGGC AAAAAGTGCT CCATGGTCGG GACGCTCTGG 
10561 CCGGTCAGGC GCGCGCAATC 6TTGACGCTC TAGACCGTGC AAAAGGAGAG CCTGTAAGCG 
10621 GGCACTCTTC CGTOOTCTGG TGGATAAATT CGCAAGGGTA TCATGGCGGA CGACCGGGGT 
10681 TCGAGCCCCG TATCOGGCCG TCCGCCGTGA TCCATGCGGT TACOGCCCGC GTGTCGAACC 
10741 CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC AGGCGCGGCG 
10801 GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC OTAAGCGGTT AGGCTGGAAA 
10861 GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGGAGGGT TATTTTCCAA GGGTTGAGTC 
10921 GCGGGACCCC CGGTTCGAGT CTCGGACCGG CCGGACTGCG GCGAACGGGG G TTTGCCT CC 
10981 CCGTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTnTTGC 
11041 TTTTCCCAGA TGCATCCGGT GCTOCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG 
11101 CAAGAGCAGC GGCAOACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG 
11161 ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCOQOGGCG CCGGGCTCGG 
11221 CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGG06CGGC TAGGAGCGCC CTCTCCTGAG 
11281 CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 
11341 CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTTCCACGCA 
11401 GGGCGCXSAGC TGCGGCATGQ CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTOAG 
11461 CCCGACGCGC GAACCGGGAT TAGTCCCGCG COCGCACACG TGGCGGCCGC CGACCTGGTA 
11521 ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC 
11581 GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGQAC TGATGCATCT GTGGGACTTT 
11641 GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCQCAOCT GTTCCTTATA 
11701 GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 
11761 GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAOA GCATAGTGGT GCAGGAGCGC 
11821 AGCTXGAGCC TGGCTCACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGQCAAG 
11881 TTTTAOGCCC GCAAGATATA CCATACCCCT TAOGTTCCCA TAQACAAGGA GGTAAAGATC 
11941 GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGOGTT 
12001 TATCGCAACG AGCGCATCCA CAAGGCCOTO AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 
12061 CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGOCTGGCA CGG0CAGCG6 COATAGAQAG 
12121 GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCO AOOCQOOTPG 
12181 GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCOG CGCGCGCTGG CAACOTCG6C 
12241 GGCGXGQAaO AATATGACGA GGACGATGAG TACGAGCCAG AGQACGGCGA GTACTAAOCG 
12301 GTOATGTTTC TCATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GCGGCGCTGC 
12361 AGAGCCAQCC GTCCGGCCTT AACTCCACGG ACGACTOGCG CCAGOTCATG GACOGCATCA 
12421 TGTCGCTQAC TGCQCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACOGOCTCT 
12481 CCGCAATTCT GQAAGOGGTG OTCCCGGCGC GOGCAAACCC CAC6CACGAO AAGOTGCTGO 
12541 CGATCGTAAA CGCGCTGGCX: GAAAACAGGG CCATCCGGCC CGA06AGGCC OOCCTOGTCT 
12601 ACGACGCGCT GCTTCAGCGC GTGGCTCGTr ACAACAGCQO CAAOGTGC»a ACCAACCTTO 
12661 ACCGGCTCGT GGGGGATGTC CGCGAGGCCXS TGGCGCAGOG TGAGCGCGCX3 CAGCSVGCAGQ 
12721 GCAACCTOGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACOTGC 
12781 CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG fTGACTGAGA 
12841 CACX»CAAAO TGAGGTGTAC CAQTCTGGGC CAGACTATTT TTTCCAGACC 
12901 GOrrGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TOGGGGGTCC 
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12961 GGGCTCCCAC AGGCX5ACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCX3CGCCTGT 
13021 TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 
13081 GTCACTTGCT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GGATGTGOAC GAGCATACTT 
13141 TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCCTGGAGG 
13201 CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 
13261 ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 
13321 GCGACX3GGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 
13381 TGTATGCXrrC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCX3CGGCCG 
13441 CCGTGAACCC CX3AGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 
13501 GTTTCTACAC CGGGGGATTC GAGGTGCCCX5 AGGGTAACGA TGG ATTC CTC TGGGACGACA 
13561 TAGACGACAG CGTGTTTTCC CCGCAACCX3C AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 
13621 AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG TCCGATCTAG 
13681 GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGGTCTCTTA 
13741 CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 
13801 TCCTGCAGCC GCAGCOCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 
13861 GCCTAGTGGA CAAGATCAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 
13921 GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGT GGGAG G 
13981 ACX5ATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 
14041 CGCACCTTCX3 CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 
14101 AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 
14161 GCGCGCXSGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC 
14221 GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 
14281 TCCGCGGTAC CTGCGGCCTA CCX3GGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
14341 CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 
14401 GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTCAAAACA ATGACTACAG 
14461 CCCGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 
14521 CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 
14581 GTTTAAGGCG CGGGTGATGG TGTCGCX3CTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
14641 ATACGAGTGG GTGGAGTTCA CX3CTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 
14701 CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 
14761 GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 
14821 CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 
14881 GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
14941 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 
15001 CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACACCGAACA 
15061 GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 
15121 CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 
15181 CACCTTTGCC ACACGGGCTG AGGAGAAGCG CGCTGAGGCC GAAGCAGCX3G CCGAAGCTGC 
15241 CGCCCCCGCT GCX5CAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA TCAAACCCCT 
15301 GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC AATGACAGCA CCTTCACCCA 
15361 GTACCGCAGC TGGTACCTTG CATACAACTA CGGCX5ACCCT CAGACCX5GAA TCCX3CTCATG 
15421 GACCCTGCTT TGCACTCCTG ACGTAACCTG CX^GCTCGGAG CAGGTCTACT GGTCGTTGCC 
15481 AGACATGATG CAAGACCCCG TGACCTTCCG CTCCACGCGC CAGATCAGCA ACTTTCCX3GT 
15541 GQTGGGCGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC AGGCCXmriA 
15601 CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACX3TG TTCAATCGCT TTCCCGAGAA 
15661 CCAGATTTTG GCGCGCCCGC CAGCCCCCAC CATCACCACC GTCAGTGAAA ACGTTCCTGC 
15721 TCTCACAGAT CACXX3GACX3C TACCGCTOCG CAACAGCATC GGAGGAOTCC AGCXSAGTGAC 
15781 CATTACTGAC GCCAGACX3CC GCACCTGOCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
15841 GCCX5CGCGTC CTATCXJAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA TATOSCCCAG 
15901 CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG TTTGGCGGGG CCAAGAAGCG 
15961 CTCCGACCAA CACCXyiGTGC GCGTGCGCX3G GCACTACCGC GCGCCCTGGG GCGCGCACAA 
16021 ACGCGGCCGC ACTGGGOGCA CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC 
16081 GCX3CAACTAC ACGCCCACGC CGCCACCAGT GTCCACAGTG GACX3CX3GCCA TTCAGACCGT 
16141 GGTGCGCGGA GCXTOGGCGCT ATGCTAAAAT GAAGAGACGG CGGAGGOGCG TAGCACGTCG 
16201 CCACXXK:CX5C OGACCCGGCA CTGCCGCCCA ACGCGCGGCG GCQGCCCTGC TTAACCGCGC 
16261 hCGTCGCACC GGCCGACGGG CGGCCATGCO GGCCGCT06A AGGGTGGCXXl CGGGTATTGT 
16321 CACTGTGCCC CCCAGGTCCA GGCGACGAGC GGCCGCCOCA GCAGCCXSCGG CCATTAOTGC 
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6381 TATGACTCAG GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 
.6441 CGTGCCCGTG CGCACCCGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT ACTTAGACTC 
.6501 GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA AGCGCAAAAT 
6561 CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGA6ATCTAT GGCCCCCCGA A6AAGGAAGA 
6621 GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA 
6681 TCAACTTGAC GACOAGGTGQ AACT6CTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 
6741 GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT TTACGCCCGG 
6801 TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG ACGAGGACCT 
6861 GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC ATAAGGACAT 
.6921 GCTGGCGTTG CCGCTGGACG AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 
.6981 GCAGGTCCTG CCCX3CGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCQAGTCTGG 
.7041 TGACTTGGCA CCCACCOTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG AAGATGTCTT 
.7101 GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC CAATCAAGCA 
7161 GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA CCAGTAGCAC 
7221 CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT 
.7281 GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 
.7341 AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT CGAGGAAGTA 
7401 CGGCGCCX5CC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 
.7461 CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACX3AGCA ACTACCCGAC GCCOAACCAC 
7521 CACTGGAACC CGCCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCXXaPGCG 
7581 CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
.7641 CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCX3 
7701 TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 
.7761 CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 
.7821 GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC 
.7881 CX5GAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 
.7941 GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA CTATTTTGTA 
8001 GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC CCGTTCATGG 
.8061 GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCX3GTGG CGCCTTCAGC TGGGGCTCGC 
.8121 TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 
.8181 ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
.8241 TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CAGGCAGTGC 
.8301 AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACCGGCCG 
.8361 TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 
.8421 CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 
.8481 CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCOOTAA 
.8541 CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCX3ACCG 
.8601 CCGTrGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 
.8661 CGTTGCXK5CC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 
.8721 GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 
.8781 ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCX5CGCGCC CGCTTTCCAA 
.8841 GATGQCTACC CCTTCGATGA TGCCX5CAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 
.8901 CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC OCXJICCGAGA CGTACTTCAG 
.8961 CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGOGCCTAOG CACX3ACGTGA CCACAGACCG 
.9021 GTCCCAGCGT TTGACGCTGC GGTTCATCCC T6TGGACCGT GAQGATACT6 CGTACTCGTA 
.9081 CAAGGCGCX3G TTCACCCTAG CTGTGGGTGA TAACOGTGTO CTGGACATGG CTTCCACGTA 
.9141 CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGQCACTGC 
.9201 CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAQCTGCTAC 
.9261 TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAQCA 
.9321 AGCTGAGCAG CAAAAAACTC ACGTATTTGQ GCAGGCGCCT TATTCTGGTA TAAATATTAC 
L9381 AAAGGAGGGT ATTCAAATAG GTGTCGAAGG TCAAACACCT AAATATGCCG ATAAAACATT 
L9441 TCAACCTGAA CCTCAAATAG GAGAATCTCA GTGGTAC6AA ACTGAAATTA ATCATGCAGC 
L9501 TCGGAGAOTC CTTAAAAAGA CTACCCCAAT GAAACCATGT TACGGTTCAT ATGCAAAACC 
L9561 CACAAATGAA AATGGAGGGC AAGGCATTCT TGTAAAGCAA CAAAATGGAA AGCTAlQAAAG 
L9621 TCAAGTGGAA ATGCAATTTT TCTCAACTAC TGAGGCGACC GCAGGCAATG GTGATAACTT 
L9681 GACTCCTAAA GTGGTATTGT ACAGTGAAGA TGTAGATATA GAAACCCCAG ACACTCATAT 
L9741 TTCTTACATG CCCACTATTA AOGAAGGTAA CTCACGAOAA CTAATGGGCC AACAATCTAT 
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19801 GCCCAACAGG CCTAATTACA TTGCTTTTAG GGACAATTTT ATTGGTCTAA TGTATTACAA 
19861 CAGCACGGGT AATATGGGTG TTCTGGCGGG CCAAGCATCG CAGTTGAATG CTGTTGTAGA 
19921 TTTOCAAGAC AOAAACACAG AGCTTTCATA CCAGCTTTTG CTTGATTCCA TTGGTGATAG 
19981 AACCAGGTAC TTTTCTATGT GGAATCAGGC TGTTGACAGC TATGATCCAG ATGTTAGAAT 
20041 TATTGAAAAT CATGGAACTG AAGATGAACT TCCAAATTAC TGCTTTCCAC TGGQAGGTGT 
20101 GATTAATACA GAGACTCTTA CCAAGGTAAA ACCTAAAACA GGTCAGGAAA ATGGATGGGA 
20161 AAAAGATGCT ACAGAATTTT CAGATAAAAA TGAAATAAGA GTTGGAAATA ATTTTGCCAT 
20221 GGAAATCAAT CTAAATGCCA ACCTGTGGAG AAATTTCCTG TACTCCAACA TAGCGCTGTA 
20281 TTTGCCCGAC AAGCTAAAGT ACAGTCCTTC CAACGTAAAA ATTTCTGATA ACCCAAACAC 
20341 CTACX3ACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT ACATTAACCT 
20401 TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC ACCACCGCAA 
20461 TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC CCTTCCACAT 
20521 CCAGGTGCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG GCTCATACAC 
20581 CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC TAGGAAATGA 
20641 CCTAAGGGTT GACGGAGCCA GCATTAAGTT TGATAGCATT TGCCTTTACG CCACCTTCTT 
20701 CCCCATGGCC CACAACACCG CCTCCACX3CT TGAGGCCATG CTTAGAAACX3 ACACCAACGA 
20761 CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC CCGCCAACGC 
20821 TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCCX3CG GCTGGGCCTT 
20881 CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC CTTATTACAC 
20941 CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA CCTTTA3U3AA 
21001 GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC TGCTTACCCC 
21061 CAACGAGTTT GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACX3TTG CCCAGTGTAA 
21121 CATGACCAAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG GCTACCAGGG 
21181 CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA ACTTCCAGCC 
21241 CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG TGGGCATCCT 
21301 ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC GCGAAGGACA 
21361 GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG ACAGCATTAC 
21421 CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TT66CGCATC CCATTCTCCA GTAACTTTAT 
21481 GTCCATGGGC GCACTCACAG ACCTGGGCCA AAACXTTTCTC TACGCCAACT CCGCCXACX5C 
21541 GCTAGACATG ACTTTTGAGG TGGATCCCAT GGACGAGCCC ACCCTTCTTT ATGTTTTGTT 
21601 TGAAGTCTTT GACGTGGTCC GTGTGCACCG GCCGCACCGC GGCGTCATCG AAACCGTGTA 
21661 CCTGCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC AACATCAACA 
21721 ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCATTGTCAA AGATCTTGGT 
21781 TGTGGGCCAT ATTTTTTGGG CACCTATGAC AAGCGCTTTC CAGGCTTTGT TTCTCCACAC 
21841 AAGCTCGCCT GCX5CCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT ACACTGGATG 
21901 GCCTTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TGGCTTTTCT 
21961 GACCAGCGAC TCAAGCAGGT TTACCAGTTT GA6TACGAGT CACTCCT6CG (XGTAGCGCC 
22021 ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG CGTACAGGGG 
22081 CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT TGCCAACTGG 
22141 CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT ACCCAACTCC 
22201 ATCCTCAACA GTCCCCAGGT ACAGCCCACC CTGCGTCGCA ACCAGGAACA GCTCTACAGC 
22261 TTCCTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAG GAGCGCCACT 
22321 TCrTTTTGTC ACTTGAAAAA CATGTAAAAA TAAT6TACTA GA6AGACTTT CAATAAAGOC 
22381 AAATGCTTTT ATTTGTACAC TCTOGGGTGA TTATTTACCC CCACCCTTGC CX3TCT0CQCC 
22441 GTTTAAAAAT CAAAGOOGTT CTGCCGCGCA TCGCTATOCG CCACTGQCAG 6GACACGTTG 
22501 CGATACTCGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG CAGCTCGGTG 
22561 AAGTTTTCAC TCCACAGGCT GCGCACCATC ACCAACGCGT TTAGCAGGTC GGGCGCCGAT 
22621 ATCTTGAAGT CX3CAGTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA CACAGGGTTG 
22681 CAGCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT CTTGTCGGAG 
22741 ATCAOATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGGAGTCAA CTTTGQTAGC 
22801 TOCCTTCCCA AAMGGGCGC GTGCCCAGGC TTTGAGTXGC ACTCGCACCG TAOTGGCATC 
22861 AAAAGGTQAC CGTGCCOGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA AGGCTTGATC 
22921 TGCTTAAAAG CCACCTOAGC CTTTGCGCCT TCAOAQAAGA ACATGCCGCA AGACTTGCCG 
22981 GAAAACTGAT TOOCCOGACA GGCCGCGTCG TOCACOCAGC ACCTTOCXTrC GGTGTTGOAG 
23041 ATCTCCACCA CATTTCGGCC CCACCGGTTC TTCAOQATCT TGGCCTTGCT AGACTGCTCC 
23101 TTCAGCGCGC GCTGCCCGTT TTCGCTCGTC ACATCCATTT CAATCACGTG CTCCTTATTT 
23161 ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGOGCA OCGOTGCAGC 
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23221 CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA CGACTGCAGG 
23281 TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGGTCT TGTTGCTGGT GAAGGTCAGC 
23341 TGCAACCCGC GGTGCTCCTC GTTCAGCCAG GTCTTGCATA CXSGCCXJCXIAG AGCTTCCACT 
23401 TGGTCAGGCA GTAQTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA CTTOTCCATC 
23461 AGCGCGCGCG CAGCCTCCAT GCCCTTCTCC CACGCAGACA OGATCGGCAC ACTCAGOGGG 
23521 TTCATCACCG TAATTTCACT TTCCX5CTTCG CTGGGCTCTT CCTCTTCCTC TTGCGTCCGC 
23581 ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTQTGCGCTT ACCTCCTTTG 
23641 CCATGCTTGA TTAGCACCX3G TGGGTTGCTG AAACCCACCA TTTGTAGCGC CACATCTTCT 
23701 CTTTCTTCCT CGCTGTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 
23761 GGGCGCTTCT ' ITrrLTX ' CrX ' GGGCGCAATG GCCAAATCCG CCGCCGAGGT CX3ATGGCCGC 
23821 GGGCTGGGTG TGCGCOGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 
23881 ATACQCCGCC TCATCCGCTT TTTTGGGGGC GCCC3GGGGAG OCGGCGGCGA CXK3GGACGGG 
23941 GACGACACGT CCTCCATGCT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 
24001 GTTTCGCGCT GCTCCTCTTC CCX3ACTGGCC ArTTCCTTCT CCTATAGGCA GAAAAAGATC 
24061 ATGGAQTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 
24X21 TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGAG 
24181 GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 
24241 GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA GGAACAAGTC 
24301 GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAOACGACXST GCTGTTGAAG 
24361 CATCTGCAGC GCCAGTGCX3C CATTATCTGC GACGCGTTGC AAGiAGCXXAG CGATGTGCCC 
24421 CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC GCGCX3TACCC 
24481 CCCAAACGCC AAGAAAACGG CACATGCGAG CCCAACCCGC GCCTCAACTT CTACCCCGTA 
24541 TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG CAAGATACCC 
24601 CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG GCAGGGCX3CT 
24661 GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG TCTTGGACGC 
24721 GACGAGAAGC GCX3CX3GCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA AAGTCACTCT 
24781 GGAGTOTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA ACGCAGCATC 
24841 GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT GAGC ACAGTC 
24 901 ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA TTTGCAAGAA 
24961 CT^CAGAGG AGGGCCTACC CX3CAGTTGGC GACGAGCAGC TAGCX3CGCTG GCTTCAAACG 
25021 CGCGAGCCTG CCGACTTGGA GGAGCGACGC AAACTAATGA TGGCCGCAGT GCTCGTTACC 
25081 GTGGAGCTTG AGTGCATGCA GCGGTTCTTT GCTGACCCXX3 AGATGCAGCG CAAGCTAGAG 
25141 GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACX3CC AGGCCTGCAA GATCTCCAAC 
25201 GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG CCTTGGGCAA 
25261 AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGC0CCGCX3 ACTACGTCCG CGACTGCGTT 
25321 TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA GTOCTTGGAG 
25381 GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA CCTATGGAOG 
25441 GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCXX3ACA TCATTTTCCC CGAACG CCTG 
25501 CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT GCAGAACTTT 
25561 AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA CCTGCTGTGC ACTTCCTAGC 
2S621 GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCCGCCX3C TTTGGGGCCA CTGCTACCTT 
25681 CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGOAAGACGT GAGC GQTGA C 
25741 GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC CGCACCGCTC CCTGGTTTGC 
25801 AATTCGCAGC TGCTTAACGA AAOTCAAATT ATCGQTACCT TTGAGCTGCA GGGTCCCTCG 
25861 CCTQACGAAA AGTCCGCGGC TCCGGGGTTG AAACTCACTC CGGGGCTGTG OACGTCGGCT 
25921 TACCTTCGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT CTACGAAGAC 
25981 CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTGCGTCA TTACCCAGGG CCACATTCTT 
26041 GGCCAATTGC AAGCCATCAA CAAAGCCCX3C CAAGAGTTTC TGCTACGAAA GGGACGGGGG 
26101 GTTTACrrGG ACCCCCAGTC CGGCOAGGAG CTCAACCCAA TCCCCCCGCC GCCGCAGCCC 
26161 TATCAGCAGC AGCOGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA AG CTGCAG CT 
26221 GCOGCCGCCA CCCAOGGACG AGGAGOAATA CTGGQACAGT CAGGCAGAGG AGGTTITGGA 
26281 COAGQAGGAG QAOOACATGA TGGAAGACTG OOAGAGCCTA OACGAGQAAG CTTCCGAGGT 
26341 CGAAGAGGTG TCAGACGAAA CACCGTCACC CTOGGTCGCA TTCCCCTCGC CGGCX5CCCCA 
26401 GAAATCGGCA ACCGGTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC CGCCGGCACT 
26461 GCCCXnrCGC CX3ACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG GTAAGTCCAA 
26521 GCAGCCGCCG CCGTTAGCCC AAQAGCAACA ACAGCX3CCAA GGCTACCGCT CATOG0QCX3G 
26581 GCACAAGAAC GCCATAOTTG CTTOCTTGCA AGACTGTGGG GGCAACATCT CXTTTCGCCCG 
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26641 CCGCTTTCTT CTCTACCATC ACGGCGTGGC CTTCCCCCGT AACATCCTGC ATTACTACCG 
26701 TCATCTCTAC AGCCCATACT OCACCGGCGG CAGCGGCAGC GGCAGCAACA GCAGCGGCCA 
26761 CACAGAAGCA AAGGC6ACCG GATAGCAAGA CTCTGACAAA GCCCAAGAAA TCCACAGCGG 
26821 CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CX3AACCCGTA TCGACCCGCG 
26881 AGCTTAGAAA CAGGATTTTT CCCACTCTGT ATGCTATATT TCAACAQAGC AGGGGCCAAG 
26941 AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCGATCCCT CACCCGCAGC TGCCTGTATC 
27001 ACAAAAGCGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC TTCAGTAAAT 
27061 ACT6CX5CGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA GCGCGAAAAC 
27121 TACX3TCATCT GCAGCGGCCA CACCCGGCGC CAGCACCTGT CGTCAGCGCC ATTATGAGCA 
27181 AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGQACTT GCGGCTGGAG 
27241 CTGCCCAAGA CTACTCAACC OGAATAAACT ACATGAGCGC GGGACCCCAC ATGATATCCC 
27301 GGGTCAACGG AATCCOCGCC CACCGAAACC GAATTCTCTT GQAACAGGCG GCTATTACCA 
27361 CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG TACCAGGAAA 
27421 GTCCCGCTCC CACCACTGTQ GTACTTCCCA GAGACGCCCA GGCCGAAGTT CAGATGACTA 
27481 ACTCAGGGGC GCAGCTTGCQ GGCGGCTTTC GTCACAGGGT GCGGTCGCCC GGGCAGGGTA 
27541 TAACTCACCT GACAATCAGA GGGCGAGGTA TTCAGCTCAA CGACGAGTCG 6TGA0CTCCT 
27601 CGCTTGGTCT CCGTCCGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT CCTTCATTCA 
27661 CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC TCTGGAGGCA 
27721 TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC CCCTTCTCGG 
27781 GACCTCCOGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACX3CGGTA AAGGACTCGG 
27841 CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 
27901 TCCACTCTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 
27961 TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 
28021 TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 
28081 GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTTT 
28141 GTTGCCATCT CTGTGCT6AG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
28201 CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT 
28261 GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
28321 CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
28381 TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
28441 CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
28501 AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
28561 CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
28621 TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTC6 CCGCCTGCTG 
28681 TGTGCACATT TGCATTTATT GTCAGCTTTT TAAAOGCTGG GQTCGCCACC CAAGATGATT 
28741 AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
28801 GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA GTGCACCACT 
28861 CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
28921 AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
28981 TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
29041 ATTACCATGT ACATOAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
29101 ACIGGCACTT TCTGCTGCAC TGCTATGCTA ATTACA6TGC TCGCTTTGGT CT6TACCCTA 
29161 CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAOAAAAT GCCTTAATTT 
29221 ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT C6CTGCTTGC AAAACAAATT 
29281 CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
29341 ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
29401 CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
29461 GTCCAACTAC AGCQACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
29521 ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 
29581 AACTTGGGCA T G TG G TGGTT CTCCATAGCO CTTATGTTTG TATQCCTTAT TATTATGTGG 
29641 CTCATCTGCT GCCTAAASCG CAAACGCOCC CGACCACCCA TCTATAGTCC CAT CATTGT G 
29701 CTACACCCAA ACAATGAT6G AATCCATAGA TTGGACGQAC TGAAACACAT GTTCTTTTCT 
29761 CTTACAGTAT GATTAAATGA GACATQATTC CTCOAGTTTr TATATTACTG ACCCTTGTTG 
29821 CGCTTTTTTG TGCGTGCTCC ACATTGGCTQ CGGTTTCTCA CATCGAAGTA GACTGCATTC 
29881 CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTQTCACCCT CACGCTCATC TGCAGCCTCA 
29941 TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC TTT GCATA TC 
30001 TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTC7TTAAT 
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30061 TATGAAATTT ACTGTGACTT TTCTGCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 
30121 GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 
30181 TTGCTACAAT GAAAAAAGCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCTGTTAT 
30241 GGTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 
30301 ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTATGCTTC CACTGCAACA 
30361 AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 
30421 TGAAATCAGC TACTTTAATC TAACAGGAGG AOATGACTGA CACCCTAGAT CTAGAAATGG 
30481 ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 
30541 GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 
30601 GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 
30661 ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 
30721 TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 
30781 ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 
30841 AAAAAAAAAT AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 
30901 TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 
30961 AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 
31021 ACTATCTTCA TGTTGTTGCA GATGAAGCX3C GCAAGACCGT CTGAAGATAC CTTCAACCCC 
31081 GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 
31141 GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 
31201 CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC 
31261 QAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 
31321 AAGTCAAACA TAAACXTOGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 
31381 GTGGCTGCCG CCGCACCTCT AATGGTCGCQ GQCAACACAC TCACCATGCA ATCACAGGCC 
31441 CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 
31501 GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 
31561 ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 
31621 GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 
31681 ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 
31741 TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 
31801 AATGTAGCAG GAGGACTTUIG GATTGATTCT CAAAACAGAC GCCTTATACT T GATGTTA GT 
31861 TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 
31921 AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 
31981 AACAATTCCA AAAAGCTTGA GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 
32041 ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 
32101 ACAAATCCCC TCAAAACAAA AATTGGCCAT GGCCTAGAAT TTGATTCAAA CAAGGCTATG 
32161 GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AGTAGGAAAC 
32221 AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 
32281 AATGCAGAGA AAGATGCTAA ACTGACrTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 
32341 GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 
32401 AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 
32461 GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 
32521 GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 
32581 AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 
32641 ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 
32701 TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATC CTCTTA C 
32761 ACTTTTTCAT ACATTGCCCA AGAATAAAOA ATCGTTTGTG TTAT6TTTCA ACGTGTTTAT 
32821 TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAOT AGTATAGCCC CACCACCACA 
32881 TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 
32941 ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TAAAAAGCAT 
33001 CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 
33061 CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 
33121 GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 
33181 AGAAOTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGCGGTGGTG 
33241 CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCXSCTCC GTCCTGCAGG AATACAACAT 
33301 GGCAGTGGTC TCCTCAGCX3A TGATTCGCAC CGCCCX3CAGC ATAAGGCGCC TTGTCCTCCG 
33361 GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 
33421 AATATTGTTC AAAATCCCAC AQTOCAAGGC GCTGTATCCA AAGCTCATGG CX3GGGACCAC 
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334 Bl AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGQCX5AC CCCTCATAAA 
33541 CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 
33601 TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGGCXMAAC 
33661 CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GQAGAGCCCA 
33721 GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA ATGTTGGCAC AACACAGGCA 
33781 CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 
33841 AACAACCCAT TCCTGAATCA GCX5TAAATCC CACACTGCAG GGAAQACCTC GCACX3TAACT 
33901 CACGTIGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 
33961 AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACX3GAG TGCX3CCGAGA 
34021 CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 
34081 TCCT6AAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG TCTCGCCGCT 
34141 TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC AGGCGCCCCC 
34201 TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTQATAACA TCCACCACCG 
34261 CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC ACGGGAGGAG 
34321 CX3GGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC CAAAACCTCA 
34381 AAATGAAGAT CTATTAAGTG AACGCX3CTCC CCTCCGGTGG CGTGGTCAAA CTCTACAGCC 
34441 AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG GCAAACGGCC 
34501 CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT GAATCTCCTC TATAAACATT 
34561 CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA TATATCTCTA 
34621 AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT GCTCCAGAGC GCCCTCCACC 
34681 TTCA6CCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG ACCTGTATAA 
34741 GATTCAAAAG CGGAACATTA ACAAAAATAC CX5CX3ATCCCG TAGGTCCCTT CGCAGGGCCA 
34801 GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC CACTTCCCCG CCAGGAACCT 
34861 TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA ACXIAGCGTAG 
34921 CCCCGATGTA AGCTTTGTTG CATGGGCG6C GATATAAAAT GCAAGGTGCT GCTCAAAAAA 
34981 TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG CAGATAAAGG 
35041 CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT TTCTCTCAAA CATGTCTGCG 
35101 GGTTTCTGCA TAAACACAAA ATAAAATAAC* AAAAAAACAT TTAAACATTA GAAGCCTGTC 
35161 TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACX5GAC TACGGCCATG CCGGCGTGAC 
35221 CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA CAGCTCCTCG GTCATGTCCG 
35281 GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT CATCGGTCAG TGCTAAAAAG 
35341 CGACCGAAAT AGCCCX3GGGG AATACATACC CGCAGGOGTA GAGACAACAT TACAGCCCCC 
35401 ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA AAAACCCTCC 
35461 TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC ACAGCGGCAG 
35521 CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC ACTCGACACG 
35581 GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG TATATATAGG 
35641 ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC GCACGCGAAC 
35701 CTACGCCCAG AAACQAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA CTTCCGTTTT 
35761 CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT ACAAGTTACT 
35821 CCXSCXrCTAAA ACCTACX3TCA CCCX3CCCCGT TCCCACGCCC CX3CX3CCACGT CACAAACTCC 
35881 ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA TGATG 
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LOCUS KDl 33592 bp DNA SYN 28-APR-1999 

DEFINITION KDl 
ACCESSION KDl 
KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown 

UnclaBsif led. 
REFERENCE 1 (bases 1 to 33592) 
AUTHORS Self 
JOURNAL Unpub 1 i shed . 
FEATURES Location/Qualifiers 
CDS 1.. 33592 

/gene* "KDl" 
/product="KDl" 
BASE COUNT 7744 a 9470 c 9285 g 7093 t 

ORIGIN 

1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGG AAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAA6ATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TG TAGTG TAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCGAAGA 
661 TCCCAACGAG GAGGCGGTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTG CAGGA 
721 AGGGATTGAC TTACTCACTT TTCCGCCGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
781 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAGAGGG TGAGGAGTTT GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGG AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AG TGAAAA TT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTGAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACCGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AGCTGTGACT CCGGTCCTTC TAACACACCT CCTGAGATAC ACCCGGTGGT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACGCCC CAGGCCATAA 
1441 GGTGTAAACC TGTGATTCCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGCGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 GAGTGTTTGG AAGATnTTC TGCTGTGCGT AACTTGCTGG AACAGAGCTC TAACAGTACC 
1681 ' iVrmU ll lT GGAGGTTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAATTAAG 
1741 GAGGATTACA AGTGGGAATT TGAAGAGCTT TTGAAATCCT GTGGTG AGCT GTT TGATTC T 
1801 TTGAATCTGG GTCACCAQGC GCTTTTCCAA GAGAAGGTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GOGCTGCGGC TGCTOTTGCT TTTTTQMTT TTATAAAGGA TAAATGGAGC 
1921 GAAGAAACCC ATCTOAGCGG GGGGTACCTG CTGGATTTTC TGGCCATGCA TCTQTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTGTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAGC 
2101 CCATCGAACC CGAGAGCCGG CCTGGACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGQA TGGGCAGGGG C TAAAGG GGG 
2221 TAAAOAGGGA GCGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGGAATCTA GCTTTTAGCT 
2281 TAAT6ACCAG ACACCGTCCT GAGTGTATTA CriTTCAACA GATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTOA TCTGCTOOOO CAGAAGTATT CCATAGAGCA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGOA TGATTTTGAO GAGGCTATTA GGOTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC ATTTCTGGGA 
2521 ACGGGGCCX3A GGTGGAGATA GATACGGAGG ATAGGGTGGC CTTTAGATGT AGCATGATAA 
2581 ATATGTGGCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCGGTACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCGGGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TGAGGGTAAC TCCAGGGTGC 

2 881 GCCACAATGT GGCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGGAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT OAACCTGAAC GGGGTGTTTG ACATOACCAT GAAGATCTGG AAGGTGCTGA 
3241 GGTACGATGA GACCCGCACC AGGTGCAGAC CCTGCGAGTG TGGCGGTAAA CATATTAGGA 

3 301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CX5ATCACTTG GTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAATGTGTG 
3421 GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTGCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCCGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACCXSTGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCGCTTC AGCCX3CTGCA GCCACCGCCC 
3721 GCGGGATTGT GACTGACTTT GCTTTCCTGA GCCCGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTGGATTCT TTGACCCGGG 
3841 AACTTAATGT CGTTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCTGAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATAAAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTGCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CX3CTGGGCGT 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCCC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT GTCATGTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCX3TCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCX5G CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CGCGGGCGGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTXA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCGACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAQAGAGCT GCAGCTGCXrG TCATCCCTGA GCAGGGGGGC CACTTCX5TTA AGCATGTCCC 
4921 T6ACTCGCAT GTTTTCCCTG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGAT AGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAACG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TGAGCX5TTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCACCTGC TCTACGGCAT 
5101 CTCGATCCAG CATATCTCCT CGTTTCGCGG QTTGGGGCQG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCGTCAG 
5221 CX3TAGTCTGG GTCACGGTGA AGGGGTGCGC TCCXK5GCTGC GCGCTGGCCA GGGTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTG CCX5GTCTTCG CCCTGCGCGT CGGCCAGGTA 
5341 GCATTTGACC ATGGTGTCAT AGTCCAGCCC CTCCGCX3GCG TGGCCCTTGG CGCGCAGCTT 
5401 GCCCTTGGAG GAGGCGCCGC ACOAGGGGCA GTGCAGACTT TTGAGGGCGT AGAGCTTGGG 
5461 CGCGAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGG6CCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGCCG TTCGGGGTCA AAAACCAGGT TTCCCCCATG 
5581 CTTTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCX3G TGTCCACGCT CGGTQACGAA 
5641 AAGGCTGTCC GTOTCCCCXTT ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTGTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGGACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGGTCCA CTCGCTCCAG 
5821 GGTGTGAAGA CACATGTCX5C CCTCTTCXK5C ATCAAGGAAG GTOATTGGTT TOTAGGTGTA 
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5881 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG CGCX3TTCGTC 
5941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGGTOAQT ACTCCCTCTG 
6001 AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTOAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG CCGCGATGTT 
6241 TAGCTGCACG TATTCGCGCG CAACGCACCG CCATTCGGGA AAGACGGTGG TGCGCTCGTC 
6301 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGGTG ACAAGGTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CC6CCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCGTCTCGTC CGQGGGGTCT GCQTCCACGG TAAAGACCCC 
6481 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TOCAAGTCTA GCGCCTGCTG 
6541 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCGTAA ACGTAGAGGG GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAGGGAGCGA GGAGGTCGGG ACCGAGGTTG CTACGGGCGG GCTGCTCTGC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACGTTCAAG CTGGCGTCTG TGAGACCTAC CGCGTCACGC ACGAAGGAGG COTAGQAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGCX3CAGT AGTCCAGGGT 
6961 TTCCTTGATG ATGTCATACT TATCCTGTCC CTTTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCGCGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGTGTCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACX5 CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTGCGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCX3GAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGACGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCX3GGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGGTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACGAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAA6AGACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCX3CCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGACGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
8041 GACCTGACGA CCGCGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTCGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCCGCGCGA GCCCAAAGTC CAGATGTCCG CGCGCGGCGG 
8221 TCGOAGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCCGCGG 
8281 CGTCAGGTCA GGCGGGAGCT CCTGCAGGTT TACCTCGCAT AGACGGGTCA GGGCGCGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGOCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCX5CGCGGC GGGCGGTGGG CCGOOGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCCGGAGG TAGGGGGGGC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGCGCGCGGG CAGQAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCCTCTGCG TGAAGACGAC GGGCCCGGTG AGCTTGAGCC TGAAAGAGAG TTCGACAOAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CGCAAAATCT CCTGCACGTC TCCTOAGTTG 
8761 TCTTGATAGG CX3ATCTCGGC CATGAACTGC TCGATCTCTT CCTCCTGGAG ATCTCCGCGT 
8821 CCGGCTCGCT CCACGGTGGC GGCOAGGTCG TTGGAAATGC GGGCCATGAG CT60QAQAA0 
8881 GCX3TTGAGGC CTCCCTCGTT CCAGACGCGG CTOTAGACCA CGCCCCCTTC GGCATCGCGO 
8941 GCGCGCATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCGAAGAC GGCGTAGTTT 
9001 CGCAGGCGCT GAAAGAGGTA GTTGAGGGTG GTGGCGGTGT GTTCTGCCAC GAAGAAQTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAG GCGCTCCATG 
9121 GCCTCGTAGA AGTCCACX3GC GAAGTTGAAA AACTGGGAGT TGCGCGCCGA CACGGTTAAC 
9181 TCCTCCTCCA GAAOACGGAT GAGCTCGGCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 
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9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG 
9361 ACAAAGCGCT CGATCATCTC CCCGCGGCGA 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG 
9481 GGGCTGCCAT GCGGCAGGGA TACGGCGCTA 
9541 ACTCCGCCGC CGAGGGACCT GAGCGAGTCC 
9601 AAGGCGTCTA ACCAGTCACA GTCX3CAAGGT 
9661 CGGCGGTCGG GGTTQTTTCT GGCGGAGGTG 
9721 TTGAGACXK3C GGATGGTCGA CAGAAGCACC 
9781 AGGCGGTCGG CCATGCCCCA GGCTTCXnTT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT 
9901 TCTATCGCTG CGGCGGCGGC GGAGTTTGGC 
9961 GTGACCCCGA AGCCCCTCAT CX5GCTGAAGC 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGQAC 
10381 GCGCGCGGAA AGTCGCGGAC GCGGTTCCAG 
10441 GTCGGGACQC TCTGGCCX5GT CAGGCGCGCG 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG 
10801 CAAGGGTTGA GTCGCGGGAC CCCCGGTTCG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC 
10981 GCAGCGGCAA GA6CAAGAGC AGCGGCAGAC 
11041 GTCAGGAGGG GCX3ACATCCG CGGTTGACX3C 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA 
11281 AAAGTTCCAC GCAGGGCGCG A6CTGCG6CA 
11341 GGAGGACTTT GAGCCCX3ACG CGC6AACCX3G 
11401 CGCCGACCTG GTAACCX5CAT ACGAGCAGAC 
11461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC 
11521 TCTGTGGGAC TTTGTAAGCG CGCTGGAGCA 
11581 GCTGTTCCTT ATAGTGCAGC ACAGCAGGGA 
11641 CATAGTA6AG CCCGAGGGCC GCTGGCT6CT 
11701 GGT6CAGGAG CGCAGCTTGA GCCTGGCTGA 
11761 TAGCCTGGGC AAGTTTTAC6 CCCGCAAGAT 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG 
11881 CGACCTGGGC GTTTATCGCA ACGAGCX3CAT 
11941 CGAGCTCAGC GACCGCGAGC TGATGCACAG 
12001 CXK3CGATAGA GAGGCCGAGT CCTACTTTGA 
12061 CCGACGCGCC CTGGAGGCAG CTGGGGCCX3G 
12121 TGGCAACGTC GGCGGCGTGG AGGAATATGA 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAG 
12241 CGG6CGGCGC T6CAGAGCCA GCCGTCCGGC 
12301 ATGGACCGCA TCATGTCX3CT GACTGCGCGC 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCG 
12421 GAGAAGGTGC TGGCGATCGT AAACGCGCTG 
12481 GCCX3GCCTGG TCTACGACGC GCTGCTTCAQ 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT 
12601 GCGCAGCAGC AGGGCAACCT GGGCTCCATG 
12661 CCC6CCAACXI TGCCX3CGGG0 ACAGGAGGAC 
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CGGCGACGAC GGCGCACCGG GAGGCXX5TCG 
CGGCGCATGG TCTCGGTGAC GGCGCGGCCG 
CCCGTCATGT CCCGGTTATG GGTTGGCGGG 
ACGATGCATC TCAACAATTG TTGTGTAGGT 
GCATCGACCG GATCGGAAAA CCTCTCOAGA 
AGGCTGAGCA CCGTGGCGGG CGGCAGCGGG 
CTGCTGATGA TGTAATTAAA GTAGGCG6TC 
ATGTCCTTGG OTCCGGCCTG CTGAATGCGC 
TGACATCGGC GCAGGTCTTT GTAGTAGTCT 
TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
CGTAGGTGGC GCCCTCTTCC TCCCATGCGT 
AGGGCTAGGT CGGCGACAAC GCGCTGGGCT 
GACTGGAAGT CATCCATGTC CACAAAGCGG 
CAGTTGGCCA TAACGGACCA GTTAAOGGTC 
CTGAGACXSCG AGTAAGCCCT CGAGTCAAAT 
TGGTATCCCA CCAAAAAGTG CGGCGGCGGC 
GGGGCTCCGG GGGCGAGATC TTCCAACATA 
ATCCAGGTGA TGCCGGCGGC GGTGGTGGAG 
ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
CAATCGTTGA CGCTCTAGCG TGCAAAAGQA 
TGGTGGATAA ATT06CAAGG GTATCATGGC 
CCGTCCGCCG TGATCCATGC GGTTACCGCC 
ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
TTTGGCCACT GGCCGCGCGC AGCGTAAGCG 
CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
AGTCTCGGAC CGGCCGGACT GCGGCGAACG 
GCTTGCAAAT TCCTCCGGAA ACAQQGACGA 
GGTGCTGCGG CAGATGCGCC CCCCTCCTCA 
ATGCAGGGCA CCCTCCCCTC CTCCTACCGC 
GGCAGCAGAT GGTGATTAC6 AACCCCCGCG 
GGAGGGCGAG GGCCTGGCGC GGCTAGGAGC 
GCTGAAGCGT GATACGCGTG AGGCGTACGT 
GGGAGA6GAG CCCGAGGAGA TGCGGGATCG 
TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
GATTAGTCCC GCGCGCGCAC ACGTGGCGGC 
GGTGAACCAG GAGATTAACT TTCAAAAAAG 
GCGCGAGGAG GTGGCTATAG GACTGATGCA 
AAACCCAAAT AGCAAGCCGC TCATGGCGCA 
CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
CGATTTGATA AACATCCTGC AGAGCATAGT 
CAAGGTGGCC GCCATCAACT ATTCCATGCT 
ATACCATACC CCTTACGTTC CCATAGACAA 
CATGGCGCTG AAGGTGCTTA CCTTQAOCGA 
CCACAAGGCC GTGAOCGTGA GCGG6CGGCG 
CCTGCAAAGG GCCCTGGCTG 6CACGGGCA6 
CGCGGGCGCT GACCTGCGCT GGGCCCCAAG 
ACCTGGGCTG GCGGTGGCAC CCGCGCGCGC 
CGAGGACGAT GAGTACGAGC CAQAGGACGG 
ATGATGCAAG. ACGCAACGGA CCCGGCGQTG 
CTTAACTCCA CGGACQACTG GCGCCAGGTC 
AATCCTGACG CGTTCOGGCA GCA6CCGCAG 
GTGGTCCCGG CGCGCGCAAA CCCCACGCAC 
GCCGAAAACA GGGCCATCCG GCCCXJACGAG 
CGCGTGGCTC GTTACAACAG CGGCAACOTG 
GTGCGCGAGG CCGTGGCGCA GCQTGAGCGC 
GTTGCACTAA ACGCCTTCCT GAGTACACAG 
TACACCAACT TTGTOAGCGC ACTGOGGCTA 
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12721 ATGGTGACTG AGACACCGCA AAGTGAGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAGAC AAGGCCTGCA GACCGTAAAC CTGAGCCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC CGCGCGACCG TGTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTGTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCGAGG CCATAGGTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGQ AGGCAACCCT AAACTACCTG CTGACCAACC GGCX3GCAGAA GATCCCCTCG 
13141 TTGCACAGTT TAAACAGC6A GGAGGAGCX3C ATTTTGCGCT ACGTGCAGCA GAGCGTGAGC 
13201 CTTAACCTGA TGCGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATGC CTCAAACCGG CCGTTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGCGG CCGCCGTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGGGGGA TTCX3AGGTGC CCGAGGGTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGCGTGTTT TCCC<X3CAAC CGCAGACCCT GCTAGAGTTG 
13501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTGTCCGATC TAGGCGCTGC GGCCCCGOGG TCAGATGCTA GTAGCCCATT TCCAAQCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAACAACT CGCTGCTGCA GCCX3CAGCGC GAAAAAAACC TGCCTCCGGC ATTTCCCAAC 
13741 AACGGGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AGACGTACGC GCAGGAGCAC 
13801 AGGGACGTGC CAGGCCCGCG CCCGCCCACC CGTCGTCAAA GGCACX3ACCG TCAGCGGGGT 
13861 CTGGTGTGGG AGGACGATGA CTCGGCAGAC GACAGCaiGOG TCCTGGATTT GGGAGGGAGT 
13921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
13981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
14041 CCCTTAGTAT GCXX5CGCGCG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
14101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
14161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCX5GGGG GAGAAACAGC ATCCGTTACT 
14221 CTGAGTTGGC ACCCCTATTC GACACCACCC GTGTGTACCT GGTGGACAAC AAGTCAACGG 
14281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
14401 ACTCGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCJCAAATGTG AACGAGTTCA 
14461 TGTTTACCAA TAAGTTTAAG GCGCGGGTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
14521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACGCGA TCX3TGGAGCA CTACTTGAAA GTGGGCAGAC 
14641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCX3CAAC TTCAGACTGG 
14701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
14761 CAGACATCAT TTTCCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
14821 TCTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGGATCACC TACGATGATC 
14881 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTOAAAG 
14941 ATGACACCGA ACAGGGCGGG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
15001 AAGAGAACTC CAACX3CGGCA GCCX5CGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
15061 CXMTCGCGG CGACACCTTT GCCACACGGG CTGAGGAGAA GCGCGCTGAG GCCGAAGCAG 
15121 CGGCCX3AAGC TGCCGCCCCC GCTGCGCAAC CCGAGGTCGA GAAGCCTCAG AAGAAACCGG 
15181 TGATCAAACC CCTGACAGAG GACAGCAAGA AACX5CAGTTA CAACCTAATA AGCAATGACA 
15241 GCACCTTCAC CCAGTACCXSC AGCTGGTACC TTGCATACAA CTAOGGCGAC CCTCAGACCG 
15301 GAATCCX3CTC ATGGACCCTG CTTTGCACTC CTGACGTAAC CTGCGGCTCG GAGCAGOTCT 
15361 ACTGGTCGTT GCCA6ACATG ATGCAAGACC CCXnXSACCTT CCX3CTCCACG CX3CCAGATCA 
15421 GCAACTTTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAGAGC TTCTACAACG 
15481 ACCAGGCCGT CTACTCCCAA CTCATCCX5CC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
15541 GCTTTCCCGA GAACCAGATT TTGGCGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGT6 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCGCT GCGCAACAGC ATCGGAGGAG 
15661 TCCAGCGAGT GACCATTACT GACGCCAGAC GCCX3CACCTG CCCCTACQTT TACAAGGCCC 
15721 TGGGCATAGT CTC6CCGCGC GTCCTATOOA GCCGCACTTT TTGAGCAAGC ATGTCCATCC 
15781 TTATATCGCC CAGCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAOCAAG ATGTTTGGCX3 
15841 GGGCCAAGAA GCGCTCOGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCOCGCCCT 
15901 GGGGCGCGCA CAMiCQCCGC CGCACTGGGC GCACCACCGT CGATGACX5CC ATOGAOGCGG 
15961 TCGTGGAGGA GGCX3CGCAAC TACACGCCCA CGCCGCCACC AGTGTCCACA GTGGACX5CX5G 
16021 CCATTCAGAC GGTGGTGGGC GGAGCCCGGC GCTATGCTAA AATGAAGAGA CGGCGGAGGC 
16081 GCX5TAGCA00 TCGCCACXWC CGCXX3ACCCG GCACTGCCX3C CCAACX3C6CX3 GCGGCGGCCX: 
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16141 TGCTTAACCG CGCACGTCX3C ACCGGCCX3AC GGGCGGCCAT GCGGGCCGCT CGAAGGCTGG 
16201 CCGCGGGTAT TGTCACTGTG CCCCCCAGGT CCAGGCGACG AGCGGCCGCC GCAGCAGCCG 
16261 CGGCCATTAG TGCTATGACT CAGGGTCGCA GGGGCAACGT GTATTGGGTG CGCGACTCGG 
16321 TTAGCGGCCT GCGCGTGCCC GTCCGCACCC GCCCCCCX3CG CAACTAGATT GCAAGAAAAA 
16381 ACTACTTAGA CTCGTACTGT TGTATGTATC CAGCGGCGGC GGCGCGCAAC GAAGCTATGT 
16441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC AGGTCATCGC GCCGGAGATC TATGGCCCCC 
16501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
16561 AAGATGATGA TGATGAACTT GACGACGAGG TGGAACTGCT GCACGCTACC GCGCCCAGGC 
16621 GACGGGTACA GTGGAAAGGT CGACGCGTAA AACGTGTTTT GCGACCCGGC ACCACCGTAG 
16681 TCTTTACGCC CGGTGAGCGC TCCACCCGCA CCTACAAGCG CGTGTATGAT GAGGTGTACG 
16741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
16801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
16861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC TTGCACCGTC CGAAQAAAAG CGCGGCCTAA 
16921 AGCGCGAGTC TGGTGACTTG GCACCCACCG TGCAGCTGAT GGTACCCAAO CGCCAGCGAC 
16981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG AACCTGGGCT GGAGCCCGAG GTCCGCGTGC 
17041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG GCGTGCAGAC CGTGGACGTT CAGATACCCA 
17101 CTACXa^GTAG CACCAGTATT GCCACCGCCA CAGAGGGCAT GGAGACACAA ACGTCCCCGG 
17161 TTCCCTCAGC GGTGGCXK3AT GCXIGCGGTGC AGGCXIGTCGC TGCGGCCX3CG TCCAAGACCT 
17221 CTACGGAGGT GCAAAOGGAC CCGTGGATGT TTCGCGTTTC AGCCCCCCGG CGCCCGCX3CG 
17281 GTTCX3AGGAA GTAOGGCGCC GCCAGCGCGC TACTGCCCGA ATATGCCCTA CATCCTTCCA 
17341 TTGCGCCTAC CCCCGGCTAT CGTGGCTACA CCTACCGCCC CAGAAGACGA GCAACTACCC 
17401 GACGCCGAAC CACCACTGGA ACCCGCCGCC GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
17461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
17521 GCTACCACCC CAGCATCX3TT TAAAAGCCGG TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
17581 CCTGCCGCCT CCGTTTCCCG GTGCCGGGAT TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
17641 TGGCCGGCCA CGGCCTGAOS GGCGGCATGC GTCGTGCGCA CCACCGGCGG CGGCGCGCGT 
17701 CGCACCGTCG CATGCGC6GC GGTATCCTGC CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
17761 TTGGCGCCGT GCCCGGAATT GCATCCGTGG CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
17821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA AGTCTGGACT CTCACGCTCG CTTGGTCCTG 
17881 TAACTATTTT GTAGAATGGA AGACATCAAC TTTGCGTCTC TGGCCCCGCG ACACGGCTCG 
17941 CQCCCGTTCA TGGGAAACTG GCAAGATATC GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
18001 AGCTGGGGCT CGCTCTGGAG CX5GCATTAAA AATTTCGGTT CCACCGTTAA GAACTATGGC 
18061 AGCAAGGCCT GQAACAGCAG CACAGGCCAG ATGCTGAGGG ATAAGTTGAA AGAGCAAAAT 
18121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
18181 AACCAGGCAG TGCAAAATAA GATTAACAGT AAGCTTGATC CCCGCCCTCC CGTAGAGGAG 
18241 CCTCCACCGG CCGTGGAGAC AGTGTCTCCA GAGGGGCGTG GCGAAAAGCG TCCX5CGCCCC 
18301 GACAGGGAAG ATACTCTGGT GACX3CAAATA GACX3AGCCTC CCTCGTACGA GGAGGCACTA 
18361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
18421 CACACACCCG TAACGCTGGA CCTGCCTCCC CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
18481 CCAGGCCCGA CCX3CCGTTGT TGTAACCCGT CCTAGCCGCG CGTCCCTGCG CCGCGCCX3CC 
18541 AGCX;GTCCX3C GATCGTTGC6 GCCCGTAGCC AGTGGCAACT GGCAAAGCAC ACTGAACA6C 
18601 ATCX3TGGGTC TGGGGGTGCA ATCCCTQAAG CGCCGACX3AT GCTTCTGAAT AGCTAACX5TG 
18661 TCGTATGTGT GTCATGTATG CGTCCATGTC GCCX3CCAGAG GAGCTGCPGA GC06CCGCGC 
18721 GCCCGCTTTC CAAGATGGCT ACXTCCTTCGA TGATGCCGCA GTGGTCTTAC ATGCACATCT 
18781 CX3GGCCAGQA CGCCTCGGAG TACCTGAGCC CCGGGCTGQT GCAOTTTGCC CGOGCCACCG 
18841 AGACGTACTT CAGCCTGAAT AACAAGTTTA GAAACCCCAC GGTGGCGCCT ACGCACQACX3 
18901 TGACCACAGA CCGGTCCCAG CX5TTTGACGC TGCGGTTCAT CCCTGTGGAC CGTGAGGATA 
18961 CTGCGTACTC GTACAAGGCG CGGTTCACCC TAGCTGTGGG TGATAACCGT GTGCTGGACA 
19021 TGGCTTCCAC GTACTTTOAC ATCCX3CGQCG TGCTGGACAG GGGCCCTACT TTTAAOCCCT 
19081 ACTCTGGCAC TGCCTACAAC GCXICTGGCrC CCAAGGGTGC CCCAAATCCT TGOOAATGGG 
19141 ATGAAGCTGC TACTGCTCTT GAAATAAACC TAOAAGAAGA GGACGATGAC AAOOAAGACG 
19201 AAGTAGACGA GCAAGCTGAG CAGCAAAAAA CTCACGTATT TGGGCAGGCG CCTTATTCTG 
19261 GTATAAATAT TACAAAGGAG GGTATTCAAA TAGGTGTCGA AGGTCAAACA CCTAAATATG 
19321 CCGATAAAAC ATTTCAACCT GAACCTCAAA TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
19381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA AGACTACCCC AATGAAACCA TGTTACGGTT 
19441 CATATGCAAA ACCCACAAAT GAAAATGGAG GGCAAGGCAT TCTTGTAAAG CAACAAAATG 
19501 OAAAGCIAGA AAGTCAAOTG GAAATOCAAT TTTTCTCAAC TACTOAGGCX} ACOGCAGGCA 
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19561 ATGGTGATAA CTTGACTCCT AAAGTGGTAT TGTACAGTGA AGATGTAGAT ATAGAAACCC 
19621 CAGACACTCA TATTTCTTAC ATGCCCACTA TTAAGGAAGG TAACTCACGA GAACTAATGG 
19681 GCCAACAATC TATGCCCAAC AGGCCTAATT ACATTGCTTT TAGGGACAAT TTTATTGGTC 
19741 TAATGTATTA CAACAGCACG GGTAATATGG GTGTTCTGGC GGGCCAAGCA TCGCAGTTGA 
19801 ATGCTGTTGT AGATTTGCAA GACAGAAACA CAGAGCTTTC ATACCAGCTT TTGCTTGATT 
19861 CCATTGGTGA TAGAACCAGG TACTTTTCTA TGTGGAATCA GGCTGTTGAC AGCTATGATC 
19921 CAGATCTTAG AATTATTGAA AATCATGGAA CTGAAGATGA ACTTCCAAAT TACTGCTTTC 
19981 CACTGGGAGG TGTGATTAAT ACAGAGACTC TTACCAAGGT AAAACCTAAA ACAGGTCAGG 
20041 AAAATGGATG GGAAAAAGAT GCTACyVGAAT TTTCAGATAA AAATGAAATA AGAGTTGGAA 
20101 ATAATTTTGC CATGGAAATC AATCTAAATG CCAACCTGTG GAGA/U^TTTC CTGTACTCCA 
20161 ACATAGCGCT GTATTTGCCC GACAAGCTAA AGTACAGTCC TTCCAACGTA AAAATTTCTG 
20221 ATAACCCAAA CACCTACGAC TACATGAACA AGCGAGTGGT GGCTCCCGGG TTAGTGGACT 
20281 GCTACATTAA CCTTGGAGCA CGCTGGTCCC TTGACTATAT GGACAACGTC AACCCATTTA 
20341 ACCACCACCG CAATGCTGGC CTGCGCTACC GCTCAATGTT GCTGGGCAAT GGTCGCTATG 
20401 TGCCCTTCCA CATCCAGGTG CCTCAGAAGT TCTTTGCCAT TAAAAACCTC CTTCTCCTGC 
20461 CGGGCTCATA CACCTACGAG TGGAACTTCA GGAAGGATGT TAACATGGTT CTGCAGAGCT 
20521 CCCTAGGAAA TGACCTAAGG GTTGACGGAG CCAGCATTAA GTTTGATAGC ATTTGCCTTT 
20581 ACGCCACCTT CTTCCCXa^TG GCCCACAACA CCX5CCTCCAC GCTTGAGGCC ATGCTTAGAA 
20641 ACGACACCAA CGACCAGTCC TTTAACGACT ATCTCTCCGC CGCCAACATG CTCTACCCTA 
20701 TACCCGCCAA CGCTACCAAC GTGCCCATAT CCATCCCCTC CCXSCAACTGG GCGGCTTTCC 
20761 GCGGCTGGGC CTTCACGCGC CTTAAGACTA AGGAAACCCC ATCACTGGGC TCGGGCTACG 
20821 ACCCTTATTA CACCTACTCT GGCTCTATAC CCTACCTAGA TGGAACCTTT TACCTCAACC 
20881 ACACCTTTAA GAAGGTGGCC ATTACCTTTG ACTCTTCTGT CAGCTGGCCT GGCAATGACC 
20941 GCCTGCTTAC CCCCAACGAG TTTGAAATTA AGCGCTCAGT TGACGGGGAG GGTTACAAOG 
21001 TTGCCCAGTG TAACATGACC AAAQACTGGT TCCTGGTACA AATGCTAGCT AACTACAACA 
21061 TTGGCTACCA GGGCTTCTAT ATCCCAGAGA GCTACAAGGA CCGCATGTAC TCCTTCTTTA 
21121 GAAACTTCCA GCCCATGAGC CGTCAGGTGG TGGATGATAC TT^TACAAG GACTACCAAC 
21181 AGGTGGGCAT CCTACACCAA CACAACAACT CTGGATTTGT TGGCTACCTT GCCCCCACCA 
21241 TGCGCGAAGG ACAGGCCTAC CCTGCTAACT TCCCCTATCC GCTTATAGGC AAGACCGCAG 
21301 TTGACAGCAT TACCCAGAAA AAGTTTCTTT GCGATCGCAC CCTTTGGCGC ATCCCATTCT 
21361 CCAGTAACTT TATGTCCATG GGCGCACTCA CAGACCTGGG CCAAAACCTT CTCTACGCCA 
21421 ACTCCGCCCA CGCGCTAGAC ATGACTTTTG AGGTGGATCC CATGGACGAG CCCACCCTTC 
21481 TTTATGTTTT GTTTGAAGTC TTTGACGTGG TCCX3TGTGCA CCGGCCGCAC CGCX3GCGTCA 
21541 TCGAAACCGT GTACCTGCGC ACGCCCTTCT CGGCCGGCAA CGCCACAACA TAAAGAAGCA 
21601 AGCAACATCA ACAACAGCTG CCGCCATGGG CTCCAGTGAG CAGGAACTGA AAGCCATTGT 
21661 CAAAGATCTT GGTTGTGGGC CATATTTTTT GGGCACCTAT GACAAGCGCT TTCCAGGCTT 
21721 TGTTTCTCCA CACAAGCTCG CCTGCGCCAT AGTCAATACG GCCGGTCGCG AGACTGGGGG 
21781 CGTACACTGG ATGGCCTTTG CCTGGAACCC GCACTCAAAA ACATGCTACC TCTTTGAGCC 
21841 CTTTGGCTTT TCTGACCAGC GACTCAAGCA GGTTTACCAG TTTGAGTACG AGTCACTCCT 
21901 GCGCCGTAGC GCCATTGCTT CTTCCCCCGA CCGCTGTATA ACGCTGGAAA AGTCCACCCA 
21961 AAGCGTACAG GGGCCCAACT CGGCCGCCTG TGGACTATTC TGCTGCATGT TTCTCCACGC 
22021 CTTTGCCAAC TGGCCCCAAA CTCCCATGGA TCACAACCCC ACCATGAACC TTATTACOGG 
22081 GGTACCCAAC TCCATGCTCA ACAGTCCCCA GGTACAGCCC ACCCTGCGTC GCAACCAGOA 
22141 ACAGCTCTAC AGCTTCCTGG AGCX3CCACTC GCCCTACTTC CGCAGCCACA GTGOGCAGAT 
22201 TAGGAGCGCC ACTTCTTTTT GTCACTTGAA AAACATGTAA AAATAATGTA CTAGAGACAC 
22261 TTTCAATAAA GGCAAATGCT TTTATTTGTA CACTCTCGGG TGATTATTTA CCCCCACCCT 
22321 TGCXX3TCTGC GCOGTTTAAA AATCAAAGGG GTTCTGCCGC GCATOGCTAT GCGCCACTGG 
22381 CAGGGACACG TTGCGATACT GGTGTTTAGT GCTCCACTTA AACTCAGGCA C AACC ATCCG 
22441 CGGCAGCTCG GTGAAGTTTT CACTCCACAG GCTGCX3CACC ATCACCAACG CGTTTAGCAO 
22501 GTCGGGCGCC GATATCTTGA AGTCGCAGTT GGGGCCTCCG CCCTGCGCXIC GCQAGTTGCG 
22561 ATACACAGGG ITGCAGCACT GQAACACTAT CAGCGCCGGG TGGTGCACGC TGGCCAGCAC 
22621 GCTCTTQTCG GAGATCAGAT CXX3CGTCCAG GTCCTCCGCX5 TTGCTCAGGG CGAACGGAGT 
22681 CAACrrTGGT AGCTGCCTTC CCAAAAAGGG C6CGT0CCCA GGCTTTGAGT TGCACTCGCA 
22741 CCGTAGTGGC ATCAAAAGGT GACCGTGCXrC GGTCTGGGCG TTAGGATACA GCGCCTGCAT 
22801 AAAAGCCTTG ATCTGCTTAA AAGCCACCTG AGCCTTTGCG CCTTCAGAGA AGA ACATGC C 
22861 GCAAQACTTG CCGGAAAACT GATTGGCCGO ACAGGCCGCG TCGTOCACX3C AGCACCTTGC 
22921 OTCGGTGTTO GAGATCTGCA CCACATTTCG GCCCCACCGG TTCTTCACGA TCTTGGCCTT 
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22981 GCTAGACTGC TCCTTCAGCG CGCGCTGCCC GTTTTCGCTC GTCACATCCA TTTCAATCAC 
23041 GTGCTCCTTA TTTATCATAA TGCTTCCGTG TAGACACTTA AGCTCGCCTT CGATCTCAGC 
23101 GCAGCGGTGC AGCCACAACG CGCAGCCCGT GGGCTCGTGA TGCTTGTAGG TCACCTCTGC 
23161 AAACGACTGC AGGTACGCCT GCAGGAATCG CCCCATCATC GTCACAAAGG TCTTGTTGCT 
23221 GGTGAAGGTC AGCTGCAACC CGCGGTGCTC CTCGTTCAGC CAGGTCTTGC ATACGGCCGC 
23281 CAGAGCTTCC ACTTGGTCAG GCAGTAGTTT GAAGTTCGCC TTTAGATCGT TATCCACGTG 
23341 GTACTTOTCC ATCAGCGCGC GCGCAGCCTC CATGCCCTTC TCCCACX3CAG ACACGATCGG 
23401 CACACTCAGC OGGTTCATCA CCGTAATTTC ACTTTCCGCT TCGCTGGGCT CTTCCTCTTC 
23461 CTCTTGCGTC CGCATACCAC GCGCCACTGG GTCGTCTTCA TTCAGCCX5CC GCACTGTGCG 
23521 CTTACCTCCT TTGCCATGCT TGATTAGCAC CGGTGGGTTG CTGAAACCCA CCATTTGTAG 
23581 CGCCACATCT TCTCTTTCTT CCTCGCTGTC CACGATTACC TCTGGTGATG 6CGGGCGCTC 
23641 GGGCTTGGGA GAAGGGCGCT TCTTTTTCTT CTTGGGCGCA ATGGCCAAAT CCGCCGCCGA 
23701 GGTCGATGGC CGCGGGCTGG GTGTGCGCGG CACCAGCGCG TCTTGTGATG AGTCTTCCTC 
23761 GTCCTCGGAC TCGATACGCC GCCTCATCCG CTTTTTTGGG GGCX5CCCX3GG GAGGCGGCGG 
23821 CGACGGGGAC GGGQACGACA CX5TCCTCCAT GGTTGGGGGA CGTCGCQCCG CACCGCGTCC 
23881 GCGCTCGGGG GTGGTTTCGC GCTGCTCCTC TTCCCGACTG GCCATTTCCT TCTCCTATAG 
23941 GCAGAAAAAG ATCATGGAGT CAGTCGAGAA GAAGGACAGC CTAACCGCCC CCTCTGAOTT 
24001 CGCCACCACC GCCTCCACCG ATGCCGCCAA CGCGCCTACC A CCTTCC CCG TCGAGGCACC 
24061 CCCGCTTGAG GAGGAGGAAG TGATTATCX3A GCAGGACCCA GGTTTTGTAA GCGAAGACGA 
24121 CGAGGACCGC TCAGTACCAA CAGAGGATAA AAAGCAAGAC CAGGACAACG CAGAGGCAAA 
24181 CGAGGAACAA GTCGGGCGGG GGGACGAAAG GCATGGCGAC TACCTAGATG TGGGAGACGA 
24241 CGTGCTGTTG AAGCATCTGC AGCGCCAGTG CGCCATTATC TOCGACX3CGT TGCAAGAGCG 
24301 CAGCGATGTG CCCCTCGCCA TAGCGGATCT CAGCCTTGCC TACX3AACGCC ACCTATTCTC 
24361 ACCGCGCGTA CCCCCCAAAC GCCAAQAAAA CGGCACATGC GAGCCCAACC CGCGCCTCAA 
24421 CTTCTACCCC GTATTTGCCG TGCCAGAGGT GCTTGCCACC TATCACATCT TTTTCCAAAA 
24481 CTGCAAGATA CCCCTATCCT GCCGTGCCAA CCGCAGCCGA GCGGACAAGC AGCT GGCCT T 
24541 GCGGCAGGGC GCTGTCATAC CTGATATCGC CTCGCTCAAC GAAGTGCCAA AAATCTTTGA 
24601 GGGTCTTGGA CGCGACX5AGA AGCGCGCGGC AAACGCTCTG CAACAGGAAA ACAGCX5AAAA 
24661 TGAAAGTCAC TCTGGAGTGT TGGTGGAACT CGAGGGTGAC AACX3CX5CGCC TAGCCGTACT 
24721 AAAACGCAGC ATCGAGGTCA CCCACTTTGC CTACCCGGCA CTTAACCTAC CCCCCAAGGT 
24781 CATGAGCACA GTCATGAGTG AGCTGATCGT GCGCCGTGCG CAGCCCCTGG AGAGGGATGC 
24841 AAATTTGCAA GAACAAACAG AGGAGGGCCT ACCCGCAGTT GGCGACQAGC AGCTAGCX5CG 
24901 CTGGCTTCAA ACGCX3CGAGC CTGCCGACTT GGAGGAGCGA CGCAAACTAA TGATGGCCGC 
24961 AGTGCTCGTT ACCGTGGAGC TTGAGTGCAT GCAGCGGTTC TTTGCTGACC CGGAGATGCA 
25021 GCGCAAGCTA GAGGAAACAT TGCACTACAC CTTTCGACAG GGCTACGTAC GCCAGGCCTG 
25081 CAAGATCTCC AACGTGGAGC TCTGCAACCT GGTCTCCTAC CTTGGAATTT TGCACX3AAAA 
25141 CCGCCTTGGG CAAAACGTGC TTCATTCCAC GCTCAAGGGC GAGGCGCGCC GC GACTA CGT 
25201 CCGCGACTGC GTTTACTTAT TTCTATGCTA CACCTGOCAG ACXK3CCATGG GCGTTTGGCA 
25261 GCAGTGCTTG GAGGAGTGCA ACCTCAAGGA GCTGCAGAAA CTGCTAAAGC AAAACT TGAA 
25321 GGACCTATGG ACGGCCTTCA ACGAGCGCTC CGTGGCCGCG CACCTGGOGG ACATCATTTT 
25381 CCCCX3AACGC CTGCTTAAAA CCCTGCAACA GGGTCTGCCA GACTTCACCA GTCAAAGCAT 
25441 GTTGCAGAAC TTTAGGAACT TTATCCTAGA GCGCTCAGQA ATCTTGCCCG CCACCTGCTG 
25501 TGCACTTCCT AGCGACTTTG TGCOCATTAA GTACCGCX3AA TGCCCTCCGC CX5CTTTGGGG 
25561 CCACTGCTAC CTTCTGCAGC TAGCCAACTA CCTTGCCTAC CACTCTGACA TAATGGAAGA 
25621 CX3T0AGCGGT OACOQTCTAC TGGAGTGTCA CTGTCGCTGC AACCTATGCA CCCCGCACCG 
25681 CTCCCTGGTT TGCAATTCGC AGCTGCTTAA CGAAAGTCAA ATTATCGOTA CCTTTQAGCT 
25741 GCAGGGTCCC TCGCXTTGACXS AAAAGTCXIGC GGCTCCGGGG TTGAAACTCA CTCCGGGGCT 
25801 GTGGACXyrCG GCTTACCTTC GCAAATTTGT ACCTGAGGAC TACCACX5CCC ACGAGATTAG 
25861 GTTCTACGAA GACCAATCCC GCXCGCCAAA TGCGGAGCTT ACCGCCTGCXS TCATTACCCA 
25921 GGGCCACATT CTTGGCCAAT TGCAAGCCAT CAACAAAGCC CGCCAAGAGT TTCTGCTACG 
25981 AAAC3GGACGG GGGGTTTACT TGGACCCCCA GTCCX3GCGAG GAGCTCAACC CAATCCCCXrC 
26041 GCCGCOGCAG CCCTATCAOC AOCAGCCGCXS GGCCCTTGCT TCCCAGGATQ GCACCCAAAA 
26101 AGAAGCTGCA GCTGCCGCCO CCACCCACGO ACGAGGAGGA ATACTGGOAC AGTCAGGCAG 
26161 AGGAGGTrrr GGACGAGGAG GAGGAGGACA TGATGGAAGA CTGGGAGAGC CTAOAOGAGG 
26221 AAGCTTCCGA GGTCGAAGAG GTGTCAQACX3 AAACACCGTC ACCCTCGGTC GCATTCCCCT 
26281 CGCCGGCX5CC CCAGAAATCG GCAACCGGTT CCAGCATGGC TACAACCTCC GCTCCTCAGG 
26341 CX5CCGCCGGC ACTGCCCGTT CGCCGACCCA ACCGTAGATG GGACACCACT GGAACCAGGG 
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26401 CCGGTAAGTC CAAGCAGCCG CCGCCGTTAG CCCAAGAGCA 
2 64 61 GCTCATGGCG CGGGCACAAG AACGCCATAG r XU Cr m C lT 
26521 TCTCCTTCGC CCGCCGCTTT CTTCTCTACC ATCACGGCGT 
26581 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC 
26761 GTATCGACCC GCGAGCTTAG AAACAGGATT TTTCCCACTC 
26821 AGCAGGGGCC AAGAACAAGA GCTGAAAATA AAAAACAGGT 
26881 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA 
26941 CTCTTCAGTA AATACTGCGC GCT6ACTCTT AAGGACTAGT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG 
27061 GCCATTATGA GCAAGGAAAT TCCCACGCCC TACATGTGGA 
27121 CTTGCGGCTG GAGCTGCCCA AGACTACTCA ACCCGAATAA 
27181 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA 
27241 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCX: 
27301 GTGTACCAGG AAAGTCCCX3C TCCCACCACT GTGGTACTTC 
27361 GTTCAGATGA CTAACTCAGG GGCGCAGCTT GCGGGCX5GCT 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG 
27481 TCGGTGAGCT CCTCGCTTGG TCTCCGTCCG GACGGGACAT 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT 
27721 GTAAAGGACT CGGCGGACGG CTACGACTGA TAATTAAGTG 
27781 CTGAAACACC TGGTCCACTG TCGCCGCCAC AAGTGCTTTG 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGATCTTT 
27901 TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
27961 CGTCTTCACC CGCCCAAGCA AACCaUVGGCG AACCTTACCT 
28021 CTCTGTGATT TACAACAGTT TCAACCCAOA CGGAGTGAGT 
28081 GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
28141 TTAAAAGTCA GGCTTCCTGG ATGTCAGCAT CTGACTTTGG 
28201 TTGTTCCAGT CCAACTACAG CGACCCACCC TAACAGAGAT 
28261 CCGCCGCTAC CGGACTTACA TCTACCACAA ATACACCCCA 
28321 ACTGGGATAA CTTGGGCATG TGGTGGTTCT CCATAGCGCT 
28381 TTATGTGGCT CATCTGCTGC CTAAAGCGCA AACX5CGCCCG 
28441 TCATTGTGCT ACACCCAAAC AATGATGGAA TCCATAGATT 
28501 TCTTTTCTCT TACAGTATGA TTAAATGAGA TTAATTAAGG 
28561 AGCAGCACCT CCTTGCCCTC CTCCCAGCTC TGGTATTGCA 
28621 TTTCTCCACA ATCTAAATGG AATGTCAGTT TCCTCCTGTT 
28681 ATCTTCATGT TGTTGCAGAT GAAGCGCGCA AGACCGTCTG 
28741 TATCCATATG ACACGGAAAC CGGTCCTCCA ACTGTGCCTT 
26801 TCCCCCAATG GGTTTCAAGA GAGTCCCCCT GGGGTACTCT 
28861 CTAGTTACCT CCAATGGCAT GCTTGCGCTC AAAATGGGCA 
28921 GCCX3GCAACC TTACCTCCCA AAATGTAACC ACTGTGA6CC 
28981 TCAAACATAA ACCTGGAAAT ATCTGCACCC CTCACAGTTA 
29041 GCTGCCGCCG CACCTCTAAT GGTCGCGGGC AACACACTCA 
29101 CTAACCGTGC ACGACTCCAA ACTTAGCATT GCCACCCAAG 
29161 GGAAAGCTAG CXTCTGCAAAC ATCAGGCCCC CTCACCACCA 
29221 ATCACTGCCT CACCCCCTCT AACTACTGCC ACTGGTAGCT 
29281 CCCATTTATA CACAAAATGG AAAACTAGGA CTAAAGTACG 
29341 GAC6ACCTAA ACACTTTCSAC CQTA6CAACT GGTCCAGGTG 
29401 TT6CAAACTA AAGTTACTGG AGCCTTGGGT TTTGATTCAC 
29461 GTAGCAGGAG GACTAAOGAT TQATTCTCAA AACAGACGCC 
29521 CCGTTTGATG CTCAAAACCA ACTAAATCTA AGACTAGGAC 
29581 TCAGCCCACA ACTTGGATAT TAACTACAAC AAAGGCCTTT 
29641 AATTCCAAAA AGCTTGAGGT TAACCXAAGC ACTGCCAAGG 
29701 GCCATAGCCA TTAATGCAGG AGATGGGCTT GAATTTGGTT 
29761 AATCCCCTCA AAACAAAAAT TGGCCATGGC CTAGAATTTG 



ACAACAGCGC CAAGGCTACC 
GCAAGACTGT GGGGGCAACA 
GGCCTTCCCC CGTAACATCC 
CGGCAGCX3GC AGCGGCAGCA 
AGACTCTGAC AAAGCCCAAG 
GTCTGGCGCC CAACGAACCC 
TGTATGCTAT ATTTCAACAG 
CTCTGCGATC CCTCACCCGC 
CGCTGGAAGA CGCGGAGGCT 
TTCGCGCCCT TTCTCAAATT 
CGCCAGCACC TGTCGTCAGC 
GrrACCAGCC ACAAATGGGA 
ACTACATGAG CX3CGGGACCC 
ACCGAATTCT CTTGGAACAG 
GTAGTTGGCC CGCTGCCCTG 
CCAGAGACGC CCAGGCCX^AA 
TTCGTCACAG GGTGCGGTCG 
GTATTCAGCT CAACGACGAG 
TTCAGATCGG CGGCGCCX3GC 
AGACCTCGTC CTCTGAGCCG 
TTGTGCCATC GGTCTACTTT 
TTATTCCTAA CTTTGACGCG 
GAGAGGCAGA GCAACT6CGC 
CCCGCGACTC CGGTGAGTTT 
GTTGCCATCT CTGTGCTGAG 
CGCCATCCTG TAAACGCCAC 
GGTACTTTTA ACATCTCTCC 
CTACGAGAGA ACCTCTCCGA 
TGCCGGGAAC GTACCCTTAA 
CCAGCACCTG TCCCGCGGAT 
GACCAACACA ACCAACGCGG 
AGTTTCTGCC TTTGTCAATA 
TATGTTTGTA TGCCTTATTA 
ACCACCCATC TATAGTCCCA 
GGACGGACTG AAACACATGT 
AATTTCTGTC CAGTTTATTC 
GCTTCCTCCT GGCTGCAAAC 
CCTGTCCATC CGCACCCACT 
AAGATACCTT CAACCCCGTG 
TTCTTACTCC TCCCTTTGTA 
CTTTGCGCCT ATCCGAACCT 
ACGGCCTCTC TCTGGACGAG 
CACCTCTCAA AAAAACCAAG 
CCTCAGAAGC CCTAACTOTG 
CCATGCAATC ACAGGCCCCG 
GACCCCTCAC AGTGTCAGAA 
CCGATAGCAG TACCCTTACT 
TGGQCATTGA CTTGAAAGAG 
GGGCTCCTTT GCAT6TAACA 
TQACTATTAA TAATACTTCC 
AAGOCAATAT GCAACTTAAT 
TTATACTTGA TOTTAGTTAT 
AGGGCCCTCT TTTTATAAAC 
ACTTGTTTAC AGCTTCAAAC 
GGTTGAT6TT TGACGCTACA 
CACCTAATGC ACCAAACACA 
ATTCAAACAA GGCXATGGTT 
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29821 CCTAAACTAG GAACTGGCCT TAGnTTGAC AGCACAGGTG CCATTACAGT AGGAAACAAA 
29881 AATAATGATA AGCTAACTTT GTGGACCACA CCAGCTCCAT CTCCTAACTG TAGACTAAAT 
29941 GCAGAGAAAG ATGCTAAACT CACTTTGGTC TTAACAAAAT GTGGCAGTCA AATACTTGCT 
30001 ACAGTTTCAG TTTTGGCTGT TAAAGGCAGT TTGGCTCCAA TATCTGGAAC AGTrCAAAGT 
30061 GCTCATCTTA TTATAAGATT TQACGAAAAT GGAGTGCTAC TAAACAATTC CTTCCTGGAC 
30121 CCAGAATATT GGAACTTTAG AAATGGAGAT CTTACTGAAG GCACAGCCTA TACAAACGCT 
3 0181 GTTGGATTTA TGCCTAACCT ATCAGCTTAT CXZAAAATCTC ACGGTAAAAC TGCCAAAAGT 
30241 AACATTGTCA GTCAAGTTTA CTTAAACGGA GACAAAACTA AACCTGTAAC ACTAACCATT 
30301 ACACTAAACG GTACACAGGA AACAGGAGAC ACAACTCCAA GTGCATACTC TATGTCATTT 
30361 TCATGGGACT GGTCTGGCCA CAACTACATT AATGAAATAT TTGCCACATC CTCTTACACT 
30421 TTTTCATACA TTGCCCAAGA ATAAAGAATC GTTTGTGTTA TGITTCAACG TGTTTATTTT 
30481 TCAATTGCAG AAAATTTCAA GTCATTTTTC ATTCAGTAGT ATAGCCCCAC CACCACATAG 
30541 CTTATACAGA TCACCGTACC TTAATCAAAC TCACAGAACC CTAGTATTCA ACCTGCCACC 
30601 TCCCTCCCAA CACACAGAGT ACACAGTCCT TTCTCCCCGG CTGGCCTTAA AAAGCATCAT 
30661 ATCATGGGTA ACAGACATAT TCTTAGGTGT TATATTCCAC ACGGTITCCT GTCGAGCCAA 
30721 ACGCTCATCA GTGATATTAA TAAACTCCCC GGGCAGCTCA CTTAAGTTCA TGTCGCTGTC 
30781 CAGCTGCTGA GCCACAGGCT GCTGTCCAAC TTGCGGTTGC TTAACX3GGCG GCGAAGGAGA 
30841 AGTCCACGCC TACATGGGGG TAGAGTCATA ATCGTGCATC AGGATAGGGC GGTGGTGCTG 
30901 CAGCAGCGCG CGAATAAACT GCTGCCGCCG CCGCTCCGTC CTGCAGGAAT ACAACATGGC 
30961 AGTGGTCTCC TCAGCGATGA TTCGCACCX5C CCGCAGCATA AGGCGCCTTG TCCTCCGGGC 
31021 ACAGCAGCGC ACCCTGATCT CACTTAAATC AGCACAGTAA CTGCAGCACA GCACCACAAT 
31081 ATTGTTCAAA ATCCCACAGT GCAAGGCGCT GTATCCAAAG CTCATGGCGG GGACCACAGA 
31141 ACCCACGTGG CCATCATACC ACAAGCGCAG GTAGATTAAG TGGCGACCCC TCATAAACAC 
31201 GCTGGACATA AACATTACCT CmTGGCAT GTTGTAATTC ACCACCTCCC GGTACCATAT 
31261 AAACCTCTGA TTAAACATGG CGCCATCCAC CACCATCCTA AACCAGCTGG CCAAAACCTG 
31321 CCCGCCGGCT ATACACTGCA GGGAACCGGG ACTGGAACAA TGACAGTGGA GAGCCCAGGA 
31381 CTCGTAACCA TGGATCATCA TGCTCGTCAT GATATCAATG TTCGCACAAC ACAGGCACAC 
31441 GTGCATACAC TTCCTCAGGA TTACAAGCTC CTCCCGCGTT AQAACCATAT CCCAGGGAAC 
31501 AACCCATTCC TGAATCAGCG TAAATCCCAC ACTGCAGGGA AGACCTCGCA CGTAACTCAC 
31561 GTTGTGCATT GTCAAAGTGT TACATTCGGG CAGCAGCGGA TGATCCTCCA GTATGGTAGC 
31621 GCGGGTTTCT GTCTCAAAAG GAGGTAGACG ATCCCTACTG TAOGGAGTGC GCCGAGACAA 
31681 CCGAGATCGT GTTGGTCGTA GTGTCATGCC AAATGGAACG CCGGACGTAG TCATATTTCC 
31741 TGAAGCAAAA CCAGGTGCGG GCX5TGACAAA CAGATCTGCG TCTCCX3GTCT CGCCGCTTAG 
31801 ATCGCTCTGT GTAGTAGTTG TAGTATATCC ACTCTCTCAA AGCATCCAGG CGCCCCCTGG 
31861 CTTCGGGTTC TATGTAAACT CCTTCATGCG CCGCTGCCCT GATAACATCC ACCACCGCAG 
31921 AATAAGCCAC ACCCAGCCAA CCTACACATT CXHTCTGCGA GTCACACACG GGAGGAGCGG 
31981 GAAGAGCTGG AAGAACCATG ' riTrrrriTl ' TATTCCAAAA GATTATCCAA AACCTCAAAA 
32041 TGAAGATCTA TTAAGTGAAC GCGCTCCCCT CCGGTGGCGT GGTCAAACTC TACAGCCAAA 
32101 GAACAGATAA TGGCATTTGT AAGATGTTGC ACAATGGCTT CCAAAAGGCA AACGGCCCTC 
32161 ACGTCCAAGT GGACX5TAAAG GCTAAACCCT TCAGGGTGAA TCTCCTCTAT AAACATTCCA 
32221 GCACCTTCAA CCATGCCCAA ATAATTCTCA TCTCGCCACC TTCTCAATAT ATCTCTAAGC 
32281 AAATCCCGAA TATTAAGTCC GGCCATTGTA AAAATCTGCT CCAGAGCX5CC CTCCACCTTC 
32341 AGCCTCAAGC AGCOAATCAT QATTQCAAAA ATTCAGQTTC CTCACAGACC TGTATAAGAT 
32401 TCAAAAGCGG AACATTAACA AAAATACCGC GATCCCGTAG GTCCCTTCGC AGGGCCAGCT 
32461 GAACATAATC GTGCAGGTCT GCACGGACCA GCGCGGCCAC TTCCCXXKrCA GGAACCTTGA 
32521 CAAAAGAACC CACACTGATT ATGACACGCA TACTCGGAGC TATGCTAACC AGCGTAGCCC 
32581 CGATGTAAGC TTTGTTGCAT GGGCGGCGAT ATAAAATGCA AGGTGCTGCT CAAAAAATCA 
32641 GGCAAAGCCT CGCX3CAAAAA AGAAAGCACA TCGTAGTCAT GCTCAT6CAG ATAAAGGCAG 
32701 GTAAGCTCCG GAACCACCAC AGAAAAAGAC ACCATTTTTC TCTCAAACAT GTCTGCX5GGT 
32761 TTCTGCATAA ACACAAAATA AAATAACAAA AAAACATTTA AACATTAGAA GCCTGTCTTA 
32821 GAACAGGAAA AACAACCCTT ATAAGCATAA GACGGACTAC GGCCATGCCG GCGTGACCX3T 
32881 AAAAAAACTG GTCACXXTPGA TTAAAAAGCA CCACOGACAG CTCCTCGGTC ATGTCCGGAG 
32941 TCATAATGTA AGACTCGGTA AACACATCAG GTTOATTCAT CGOTCAGTGC TAAAAAOCOA 
33001 CCQAAATAGC CCGGGGGAAT ACATACCCGC AGGCGTAGAG ACAACATTAC AGCCCCCATA 
33061 GGAGGTATAA CAAAATTAAT AGGAGAGAAA AACACATAAA CACCTGAAAA ACCCTCCTGC 
33X21 CTAGGCAAAA TAGCACCCTC CCGCTCCAGA ACAACATACA GCGCTTCACA GCGGCAGCCT 
33181 AACAGTCAGC CTTACCAGTA AAAAAGAAAA CCTATTAAAA AAACACCACT CGACACGGCA 
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33241 CCAGCTCAAT CAGTCACAGT GTAAAAAAGG GCCAAGTGCA GAGCGAGTAT ATATAGGACT 
33301 AAAAAATGAC GTAACGGTTA AAGTCCACAA AAAACACCCA GAAAACCX3CA CGCGAACCTA 
33361 CGCCCAGAAA CGAAAGCCAA AAAACCCACA ACTTCCTCAA ATCGTCACTT CCGTTTTCCC 
33421 ACGTTACGTA ACTTCCCATT TTAAGAAAAC TACAATTCCC AACACATACA AGTTACTCCG 
33481 CCCTAAAACC TACGTCACCC GCCCCGTTCC CACGCCCCGC GCCACGTCAC AAACTCCACC 
33541 CCCTCATTAT CATATTGGCT TCAATCCAAA ATAAGGTATA TTATTGATGA TG 
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LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 



KD3 
KD3 
KD3 



34341 bp 



DNA 



SYN 



06-FEB-1999 



REFERENCE 
AUTHORS 
JOURNAL 

FEATURES 
CDS 



ORGANISM 



Unknovm. 

Unknown 

Unclassified. 

1 (bases 1 to 34341) 

Self 

Unpublished. 



Location/Qualifiers 
1. .34341 
/genee"KD3" 
/productB"KD3" 



BASE COUNT 
ORIGIN 



7951 a 9671 c 9464 g 



7255 t 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TQATAATGAG GGGGTGGAGT 
61 TTGTGACGTO GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 
121 GATGITGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCGAAGA 
661 TCCCAACGAG GAGGCGGTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTGCAGGA 
721 AGGGATTGAC TTACTCACTT TTCCGCOGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
781 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAGAGGG TGAGGAGTTT . GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGG AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AGTGAAAATT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTGAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACCGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AGCTGTGACT CCGGTCCTTC TAACACACCT CCTGA6ATAC ACCCGGTGGT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACOCCC CAGGCCATAA 
1441 GGTCTAAACC TGTGATTGCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGCGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 GAGTGTTTGG AAGATTTTTC TGCTGTGCGT AACTTGCTGG AACAGAGCTC TAACAGTACC 
1681 TCTTCGTTTT GGAGGXTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAATTAAG 
1741 GAGGATTACA AGTGGGAATT TGAAGAGCTT TTGAAATCCT GTGGTG AGCT GT TTGATTC T 
1801 TTGAATCTGG GTCACCAGGC GCTTTTCCAA GAGAAGGTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GCGCTGCGGC TGCTGTTGCT TTTTTQAGTT TTATAAAGGA TAAATGGAGC 
1921 GAAGAAACCC ATCTGAGCGG GGGGTACCTG CtGGATTTTC TGGCCATGCA TCTGTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTGTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAQC 
2101 CCATGGAACC CGAGAGCCGG CCTGGACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGGA TGGGCAGGGG C TAAAGG GGG 
2221 TAAAOAGGGA GOGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGGAATCTA GCTTTTAGCT 
2281 TAATOACCAG ACACCGTCCT GAOTOTATTA CTTTTCAACA QATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTGA TCTGCTGGCG CAQAAGTATT CCATAGA6CA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGGA TGATTTTOAG GAGGCTATTA GGGTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC ATTTCTGGGA 
2521 ACGGGGCCGA GGTGGAGATA GATACGGAGG ATAGGGTGGC CTTTAGATGT AG CATGA TAA 
2581 ATATGTGGCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCXK5TACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCGGGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TGAGGGTAAC TCCAGGGTGC 
2881 GCCACAATGT GGCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGGAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT GAACCTGAAC GGGGTGTTTG ACATGACCAT GAAGATCTGG AAGGTGCT6A 
3241 GGTACGATGA GACCCGCACC AGGTGCAGAC CCTGCGAGTG TGGCGGTAAA CATATTAGGA 
3301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CGATCACTTG GTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAA TGTGTG 
3421 GGCGTCGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTGCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCCGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACXX3TGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCX5CTTC AGCCGCTGCA GCCACX^CCC 
3721 GCX3GGATTGT GACTGACTTT GCTTTCCTGA GCCCGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTG GATTC T TTGACCCGGG 
3841 AACTTAATGT CX3TTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCTGAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATAAAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTGCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CGCTGGGCX3T 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCXTC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT GTCATGTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCGTCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCGG CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CQCX3GG0GGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTTA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCGACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAGAGAGCT GCAGCTGCCG TCATCCCTGA GCAGGGGGGC CACTTCGTTA AGCATGTCCC 
4921 TGACTCGCAT GTTTTCCXrrG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGAT AGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAACG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TCAGCX3TTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCA CCTGC TCTACXK3CAT 
5101 CTCGATCCAG CATATCTCCT CXSTTTCGCXSG GTTGGGGCGG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCGTCA6 
5221 CGTAGTCTGG GTCACGGTGA AGGGGTGCGC TCCGGGCTGC GCGCTGGCCA GGGTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTQ CCGGTCTTCG CCCTGCXSCGT CGGCCAGGTA 
5341 GCATTTOACC ATGGTGTCAT AGTCCAGCCC CTCCGCGGCG TGGCCCTTGG CGCGCJ^GCTT 
5401 GCCCTTGOAG GAGGCGCCGC ACGAGGGGCA GTGCAGACTT TTGAGGGCGT AGAGCTTGGG 
5461 COCGAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGGCCCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGC?CO TTCGGGGTCA AAAACCAGGT TTCCCCCATQ 
5581 crrTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCGG TGTCCAOGCT CGGTGACGAA 
5641 AAGGCTGTCC GTGTCCCCX3T ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTGTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGGACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGQTCCA CTCGCTCCAG 
5821 GGTGTOAAGA CACATGTCGC CCTCTTCGGC ATCAAGGAAG GTGATTGGTT TGTAGGTGTA 
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5881 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG CGCGTTCGTC 
5941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGGTGAGT ACTCXTCTCTG 
eOOl AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG COGOGATGTT 
6241 TAGCTGCACG TATTCGCGCG CAACX3CACCG CCATTCXXWA AAGACGGTGG TGCGCTCX3TC 
6301 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGOTG ACAAGOTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CCX3CCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCX5TCTCGTC CGGGGGGTCT GCGTCCACGG TAAAGACCCC 
6481 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TGCAAGTCTA GCGCCTGCTG 
6541 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCX3TAA ACGTAGAGGG GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAOGGAGCX5A GGAGGTCGGG ACCGAGGTTG CTAOGGGCGG GCTGCTCTGC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACX3TTGAAG CTGGCGTCTG TGAGACCTAC CGCGTCACGC ACGAAGGAGG CGTAGGAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGCGCAGT AGTCCAGGGT 
6961 TTCCTTGATG ATGTCATACT TATCCTGTCC CITTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCX3CGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGT6TCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACG CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTGCGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCGGAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGACGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCGGGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGQTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACOAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAAGAQACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCGCCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGACGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
8041 GACCTGACGA CCX5CGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTCGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCCGCGCGA GCCCAAAGTC CAGATGTCCG CGCGCGGCGG 
8221 TCX3K5AGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCCGCGG 
8281 CX3TCAGGTCA GGCX5GGAGCT CCTGCAGGTT TACCTCGCAT AGACGGGTCA GGGCGCGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGGCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCGCGCGGC GGGCGGTGGG CCX5CGGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCCGGAGQ TAGGGGGGGC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGCGCGCGGG CAGGAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCXrrCTGCG TGAAGAC6AC GGGCCCGGTG AGCTTGAGCC TGAAAGAGA6 TTCX3ACAGAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CX5CAAAATCT CCTGCAOGTC TCCTGAGTTG 
8761 TCTTGATAGG CGATCTCGGC CATGAACTGC TCGATCTCTT CCTCCTGGAG ATCTCCGOGT 
8821 CCGGCTCGCT CCACX3GTGGC GGCGAGGTCG TTGGAAATGC GGGCCATGAG CTGCGAGAAG 
8881 GCGTTGAGGC CTCCCTCGTT CCAGACGCGG CTGTAGACCA CGCCCCCTTC GGCATCGOGG 
8941 GCX5CX3CATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCX3AA6AC GGCGTAGTTT 
9001 CGCAGGCGCT GAAAGAGGTA GTTGAGGGTG GTGGCGGTGT GTTCTGCCAC GAAGAAGTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAG GCGCTCCATG 
9X21 GCCTCGTAGA AGTCCACXK3C GAAGTTGAAA AACTGGGAGT TGCXSCGCCGA CACGGTTAAC 
9181 TCCTCCTCCA GAAGACXX3AT GAGCTCX3GCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 
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9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG CGGCGACGAC GGCGCACCGG GAGGCGGTCG 
9361 ACAAAGCGCT CGATCATCTC CCCGCGGCX3A CGGCGCATGG TCTCGGTGAC GGCGCGGCCG 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG CCCGTCATGT CCCGGTTATG GGTTGGCGGG 
9481 GGGCTGCCAT GCGGCAGGGA TACGGCGCTA ACGATGCATC TCAACAATTG TTGTGTAGGT 
9541 ACTCCGCCGC CGAGGGACCT GAGCGAGTCC GCATCGACCG GATCGGAAAA CCTCTCGAGA 
9601 AAGGCGTCTA ACCAGTCACA GTCGCAAGGT AGGCTGAGCA CCGTGGCGGG CGGCAGCGGG 
9661 CGGCGGTCGG GGTTGTTTCT GGCGGAGGTG CTGCTGATGA TGTAATTAAA GTAGGCGGTC 
9721 TTGAGACGGC GGATGGTCGA CAGAAGCACC ATGTCCTTGG GTCCX3G CCTG CTGAATGCGC 
9781 AGGCGGTCGG CCATGCCCCA GOCTTCGTTT TGACATCGGC GCAGGTCTTT GTAG TAGT CT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
9901 TCTATCGCTG CGGCGGCGGC GGAGTTTGGC CGTAGGTGGC GCCCTCTTCC TCCCATGCX5T 
9961 GTGACCCCGA AGCCCCTCAT CGGCTGAAGC AGGGCTAGGT CGGCGACAAC GCGCTCGGCT 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA GACTGGAAGT CATCCATGTC CACAAA6CX3G 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG CAGTTGGCCA TAACGGACCA GTTAACGGTC 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC CTGAGACGCG AGTAAGCCCT CGAGTCAAAT 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC TGGTATCCCA CCAAAAAGTG CGGCXK3CGGC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC GGGGCTCCX5G GGGCX5AGATC TTCCaACATA 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGGAC ATCCAGGTGA TGCCG6CX3GC GGTGGTGGAG 
10381 GCGCGCGGAA AGTCGCGGAC GCGGTTCCAG ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
10441 GTCGGGACGC TCTGGCCGGT CAGGCGCGCG CAATCGTTGA CGCTCTAGCG TGCAAAAGGA 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC TGGTGGATAA ATTCGCAAGG GTATCATGGC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG CCGTCCGCCG TGATCCATGC GGTTACCGCC 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT TTTGGCCACT GGCCGCX5CGC AGCGT AAGCG 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
10801 CAAGGGTTGA GTCGCGGGAC CCCCX3GTTCG AGTCTCGGAC CGGCCGGACT GCX3GCGAACG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC GCTTGCAAAT TCCTCCGGAA ACAGGGACGA 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC GGTGCTGCGG CAGATGCGCC CCCCTCCTCA 
10981 GCAGCGGCAA GAGCAAGAGC AGCGGCAGAC ATGCAGGGCA CCCTCCCCTC CTCCTACCGC 
11041 GTCAGGAGGG GCGACATCCG CX3GTTGACGC GGCAGCAGAT GGTGATTACG AACCCCCGCG 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA GGAGGGCGAG GGCCTGGCGC GGCTAGGAGC 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA GCTGAAGCGT GATA0GCX3TG AGGCGTACGT 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA GGOAGAGGAG CCCGAGGAGA TGCGGGATCG 
11281 AAAGTTCCAC GCAGGGCGCG AGCTGCGGCA TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
11341 GGAGGACTTT GAGCCCGACG CGCGAACCGG GATTAGTCCC GCGCGCGCAC ACGTGGCXK3C 
11401 CGCCGACCTG GTAACCGCAT ACGAGCAGAC GGTGAACCAG GAGATTAACT TTCAAAAAAG 
1X461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC GCGCGAGGAG GTGGCTATAG GACTGATGCA 
11521 TCTCTGGGAC TTTGTAAGCG CGCTGGAGCA AAACCCAAAT AGCAAGCCGC TCATGGCGCA 
11581 GCTGTTCCTT ATAGT6CAGC ACAGCAGGGA CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
11641 CATAGTAGAG CCCGAGGGCC GCTGGCTGCT CGATTTGATA AACATCCTGC AGAGCATAGT 
11701 GGTGCAGGAG CGCAGCTTGA GCCTGGCTGA CAAGGTGGCC GCCATCAACT ATTCCATGCT 
11761 TAGCCTGGGC AAGTTTTACG CCCGCAAGAT ATACCATACC CCTTACGTTC CCATAOACAA 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG CATGGCGCTG AAGGTGCTTA CCTTGAGCGA 
11881 CGACCTGGGC GTTTATCXKIA ACGAGCGCAT CCACAAQGCC GTGAGCGTGA GCCGGCGGCG 
11941 CGAGCTCAGC GACC6CGAGC TGATGCACAG CCTGCAAAGG GCCCTGGCTG GCACGGGCAG 
12001 CGGCGATAGA GAGGCCGAGT CCTACTTTGA CGCGGGCGCT GACCTGCGCT GGGCCCCAAG 
12061 CCOACGCGCC CTGGAGGCAG CTGGGGCCX«3 ACCrGGGCTG GCX3GTGGCAC C0G0Q0GCX3C 
12121 TGGCAACX3TC GGCGGCGTGG AGGAATATGA CX3AGGACGAT GAGTACGAGC CAGAOGAOGG 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAO ATGATGCAAG ACGCAACGGA CCOGGCGGTG 
12241 CGGGCGGCGC TGCAGAGCCA GCCGTCCGGC CTTAACTCCA CGGACGACTG GCGCCAGGTC 
12301 ATGGACCGCA TCATGTCGCT GACTGCGCGC AATCCTGACG CX5TTCCGGCA GCAGCCGCAG 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCX5 GTGGTCCCGG CGCGCGCAAA CCCCACGCAC 
12421 GAGAAGGTGC TCGCGATCX5T AAACGCGCTO OCCGAAAACA GGGCCATCCG GCGCGAGGAG 
12481 GCCGGCCTGG TCTACQACGC GCTGCTTCAG CGCX5TGGCTC GTTACAACAG CGGCAACGTG 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT GTGOQCGAGG CCOT0GC6CA GCGTGAGCGC 
12601 GCGCAGCAGC AGGOCAACCT GGGCTCCATG GTPGCACTAA ACGCCTTCCT GAGTACACAG 
12661 CCCGCCAACG TGCCOCGGGO ACAGGAGGAC TACACCAACT TTOTGAOCGC ACTGCGOCTA 
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12721 ATGGTGACTG AGACACCGCA AA6T6AGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAGAC AAGGCCTGCA GACCX3TAAAC CTGAGCCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC CGCGCGACCG TGTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTGTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCX5AGG CCATAGGTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGG AGGCAACCCT AAACTACCTG CTGACCAACC GGCGGCAGAA GATCCCCTCG 
13141 TXGCACAGTT TAAACAGCGA GGAGGAGCGC ATTTTGCGCT ACGTGCAGCA GAGCGTGAGC 
13201 CTTAACCTGA T6CGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATOC CTCAAACCGG CCX3TTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGCGG CCGCCGTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGGGGGA TTCGAGGTGC CCGAGGGTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGCX3TGTTT TCCCCGCAAC CGCAGACCCT GCTAGAGTTG 
13501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTCTCCGATC TAGGCGCTGC GGCCCCGCGG TCAGATGCTA GTAGCCCATT TCCAAGCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAAGAACT 0GCT6CTGCA GCCX3CAGCGC GAAAAAAACC TGCCTCC6GC ATTTCCCAAC 
13741 AACX3GGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AQACGTACGC GCAGGAGCAC 
13801 AGGGACX5TGC CAGGCCCGCG CCCGCCCACC CX5TCGTCAAA GOCACGACCG TCAGCGGGGT 
13861 CTGGTGTGGG AGGACGATGA CTCGGCAGAC GACAGCAGCG TCCTGGATTT GGGAGGGAGT 
13921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
13981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
14041 CCCTTAGTAT GCGGCGCX5CG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
14101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
14161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCGGGGG GAGAAACAGC ATCCX3TTACT 
14221 CTGAGTTGGC ACCCCTATTC GACACXyiCCC GTGTGTACCT GQTGQACAAC AAGTCAACGG 
14281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
14401 ACTGGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCCAAATGTG AACGAGTTCA 
14461 TGTTTACCAA TAAGTTTAAG GCGCGGGTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
14521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACX5CGA TCGTGGAGCA CTACTTGAAA GTGGGCAGAC 
14641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCGCAAC TTCAGACTGG 
14701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
14761 CAGACATCAT TTTGCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
14821 TGTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGGATCACC TACGATGATC 
14881 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTGAAAG 
14941 ATGACACCGA ACAGGGCX3GG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
15001 AAGAGAACTC CAACGCGGCA GCCGCGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
15061 CCATTCGCGG CGACACCTTT GCCACACGGG CTGAGGAGAA GCGCX5CTGAG GCCGAAGCAG 
15121 CGGCCGAAGC TGCCGCCXTCC GCTGC6CAAC CCGAGGTCGA GAAGCCTCAG AAGAAACXXK3 
15181 TGATCAAACC CCTGACAGAG GACAGCAAGA AACGCAGTTA CAACCTAATA AGCAATGACA 
15241 GCACCTTCAC CCAGTACCGC AGCTGGTACC TTGCATACAA CTACXK3GGAC CCTCAGACCG 
15301 GAATCCGCTC ATGGACCCTG CTTTGCACTC CTGACQTAAC CTGCOGCTOG GAGCAGGTCT 
15361 ACTGGTCGTT GCCAGACATG ATGCAAGACC CCGTGACCTT CCGCTCCACG CGCCAGATCA 
15421 GCAACITTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAGAGC TTCTACAACG 
15481 ACCAGGCCGT CTACTCCCAA CTCATCCGCC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
15541 GCTTTCCCGA GAACCAGATT TTGGCGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGTG 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCX5CT GCGCAACAGC ATCGGAGGAG 
15661 TCCAGCGAGT GACCATTACT GAC6CCAGAC GCCGCACCTG CCCCTACGTT TACAAGGCCC 
15721 TGGGCATAGT CTCGCCGCGC GTCCTATCGA GCCX5CACTTT TTGAGGAAGC ATGTCCATCC 
15781 TTATATCGCC CAGCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAGCAAG ATGTTTGGCG 
15841 GGGCCAAGAA GCGCTCCGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCGCGCCCT 
15901 GGGGCGCGCA CAAACGCGGC CGCACTGGGC GCACCACCX3T CGATGACGCC ATCGACGCGG 
15961 TGGTGGAGGA GGCGCGCAAC TACACGCCCA CGCCGCCACC AGTGTCCACA GTGGACGCX3G 
16021 CCATTCAGAC CX3TGGTGCGC GGAGCCCGGC GCTATGCTAA AATGAAGAGA CGGCGGAGGC 
16081 GCGTAGCACG TCGCCACCGC CGCCGACCCG GCACTGCCGC CCAACGCOCG GCX3GCGGCCC 
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X6141 TGCTTAACCG CX5CACGTCGC ACCGGCCGAC 
16201 CCGCGGGTAT TGTCACTGTG CCCCCCAGGT 
16261 CGGCCATTAG TGCTATGACT CAGGGTCGCA 
16321 TTAGCGGCCT GCGCGTGCCC GTGCGCACCC 
16381 ACTACTTAGA CTCGTACTGT TGTATGTATC 
16441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC 
16501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC 
16561 AAGATGATGA TGATGAACTT GACGACGAGG 
16621 GACGGGTACA GTGGAAAGGT CGACGCGTAA 
16681 TCTTTACGCC CGGTGAGCX5C TCCACCCX5CA 
16741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG 
16801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG 
16861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC 
16921 AGCGCGAGTC TGGTGACTTG GCACCCACCG 
16981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG 
17041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG 
17101 CTACCAGTAG CACCAGTATT GCCACCGCCA 
17161 TTGCCTCAGC GGTGGOGGAT GCCGOGGTGC 
17221 CTACGGAGGT GCAAACX3GAC CCGTGGATGT 
17281 GTTCGAGGAA GTACGGCGCC GCCAGCGCX5C 
17341 TTGCGCCTAC CCCCGGCTAT CGTGGCTACA 
17401 GACGCCGAAC CACCACTGGA ACCCGCCGCC 
17461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG 
17S21 GCTACCACCC CAGCATCGTT TAAAAGCCX3G 
17581 CCTGCCGCCT CCGTTTCCCG GTGCOGGGAT 
17641 TGGCCGGCCA CGGCCTGACG GGCGGCATGC 
17701 CGCACCGTCG CATGCGCGGC GGTATCCTGC 
17761 TTGGCGCCGT GCCCGGAATT GCATCCX3TGG 
17821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA 
17881 TAACTATTTT GTAGAATGGA AGACATCAAC 
17941 CGCCCGTTCA TGGGAAACTG GCAAGATATC 
18001 AGCTGGGGCT CGCTGTGGAG CGGCATTAAA 
18061 AGCAAGGCCT GGAACAGCAG CACAGGCCAG 
18121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC 
18181 AACCAGGCAG TGCAAAATAA GATTAACAGT 
18241 CXrrCCACCGG CCGTGGAGAC AGTGTCTCCA 
18301 GACAGGGAAG AAACTCTGGT GACGCAAATA 
18361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC 
18421 CACACACCCG TAACGCTGGA CCTGCCTCCC 
18481 CCAGGCX:CGA CCGCCGTTGT TGTAACCCGT 
18541 AGCGGTCCGC GATCGTTGCG GCCCX3TAGCC 
18601 ATCGTGGGTC TGGGGGTGCA ATCCCTGAAG 
18661 TCGTATGTGT GTCATGTATG CGTCCATGTC 
18721 GCCCGCTTTC CAAGATGGCT ACCCCTTCGA 
18781 CGGGCCAGGA CGCCTCGGAG TACCTGAGCC 
18841 AGACGTACTT CAGCCTGAAT AACAAGTTTA 
18901 TGACCACAGA CCGGTCCCAG CGTTTGACGC 
18961 CTGCX3TACTC GTACAAGGCG CGGTTCACCC 
19021 TGGCTTCCAC GTACTTTQAC ATC0GCX3GCG 
19081 ACTCTGGCAC TGCCTACAAC GCCCTGGCTC 
19141 ATGAAGCTGC TACTGCTCTT GAAATAAACC 
19201 AAGTAGACGA GCAAGCTGAG CAGCAAAAAA 
19261 GTATAAATAT TACAAAGGAG GGTATTCAAA 
19321 CCGATAAAAC ATTTCAACCT GAACCTCAAA 
19381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA 
19441 CATATGCAAA ACCCACAAAT GAAAATGGAG 
19501 GAAAGCTAGA AAGTCAAGTG GAAATGCAAT 



GGGCX3GCCAT GCGGGCCGCT CGAAGGCTGG 
CCAGGCGACG AGCGGCCGCC GCAGCAGCCG 
GGGGCAACGT GTATTGGGTG CGCGACTCGG 
GCCCCCCGCG CAACTAGATT GCAAGAAAAA 
CAGCGGCGGC GGCGCGCAAC GAAGCTATGT 
AGGTCATCGC GCCX^GAGATC TATGGCCCCC 
GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
TGGAACTGCT GCACGCTACC GCGCCCAGGC 
AACGTGTTTT GCGACCCGGC ACCACCGTAG 
CCTACAAGCG CGTGTATGAT GAGGTGTACX3 
AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
TTGCACCGTC CGAAGAAAAG CGCGGCCTAA 
TGCAGCTGAT GGTACCCAAG CGCCAGCGAC 
AACCTGGGCT GGAGCCCGAG GTCCGCGTGC 
GCGTGCAGAC CX3TGGACGTT CAGATACCCA 
CAGAGGGCAT GGAGACACAA AOGTCCCCGG 
AGGC6GTCGC TGCGGCCGCG TCCAAGACCT 
TTCGCGTTTC AGCCCCCCGG CGCCCGCGCG 
TACTGCCCGA ATATGCCCTA CATCCTTCCA 
CCTACCGCCC CAGAAGACGA GCAACTACCC 
GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
GTCGTGCGCA CCACCGGOGG CGGCX3CGCGT 
CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
AGTCTGGACT CTCACX5CTCG CTTGGTCCTG 
TTTGCGTCTC TGGCCCCGCG ACACGGCTCG 
GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
AATTTCGGTT CCACCGTTAA GAACTATGGC 
ATGCTGAGGG ATAAGTT6AA AGAGCAAAAT 
TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
AAGCTTGATC CCCX5CCCTCC CGTAGAGGAG 
GAGGGGCGTG GCGAAAAGCG TCCGCGCCCC 
GACGAGCCTC CCTCGTACGA GGAGGCACTA 
GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
CCTA6CCGCQ CGTCCCTGCG CCGCGCCGCC 
AGTGGCAACT GGCAAAGCAC ACT6AACAGC 
CX3CCGACGAT GCTTCTGAAT AGCTAACGTG 
GCCGCCAGAO GAGCTGCTGA GCCX3CCGCGC 
TGATGCCGCA GTGGTCTTAC ATGCACATCT 
CCGGGCTGGT GGAGTTTGCC CGCGCCACCG 
GAAACCCCAC GGTGGCGCCT ACGCACGACG 
TGCGGTTCAT CCCTGTGGAC CGT6AGGATA 
TAGCTGTGGG TGATAACCGT GTGCTGGACA 
TGCTGGACAO GGGCCCTACT TTTAAGCCCT 
CCAAGGGTGC CCCAAATCCT TGCGAATGGG 
TAGAAGAAGA GGACX3ATGAC AACGAAGACG 
CTCACGTATT TGGGCAGGCG CCTTATTCTG 
TAGGTGTCGA AGGTCAAACA CCTAAATATG 
TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
AQACTACCCC AATGAAACCA TGTTACGQTT 
GGCAAGGCAT TCTTGTAAAG CAACAAAATG 
TTTTCTCAAC TACTGAGGCG ACCGCAGGCA 
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19S61 ATGGTGATAA CTTGACTCCT AAAGTGGTAT TGTACAGTGA AGATGTAGAT ATAGAAACCC 
19621 CAGACACTCA TATTTCTTAC ATGCCCACTA TTAAGGAAGG TAACTCACGA GAACTAATGG 
19681 GCCAACAATC TATGCCCAAC AGGCCTAATT ACATTGCTTT TAGGGACAAT TrTATTGCTC 
19741 TAATGTATTA CAACAGCACG GGTAATATGG GTGTTCTGGC GGGCCAAOCA TCGCAGTTOA 
19801 ATGCTGTTGT AGATTTCCAA GACAGAAACA CAGAGCTTTC ATACCAGCTT TTGCTTQATT 
19861 CCATTOGTGA TAGAACCAGG TACTTTTCTA TGTGGAATCA GGCTGTTQAC AGCTATGATC 
19921 CAGATGTTAG AATTATTOAA AATC»TGGAA CTGAAGATGA ACTTCCAAAT TACTGCTTTC 
19981 CACTGOOAGG TOTGATTAAT ACAGAGACTC TTACCAAGGT AAAACCTAAA ACAGGTCAGG 
20041 AAAATGGATC GGAAAAAGAT GCTACAGAAT TTTCAOATAA AAATGAAATA AGAGTTGGAA 
20101 ATAATTTTGC CATGGAAATC AATCTAAATG CCAACCTGTG GAGAAATTTC CTGTACTCCA 
20161 ACATAGCX3CT GTATTXGCCC GACAAGCTAA AGTACAGTCC TTCCAACGTA AAAATTTCTG 
20221 ATAACCCAAA CACCTACX3AC TACATGAACA AGCGAGTGGT GGCTCCCGGG TTAGTGGACT 
20281 GCTACATTAA CCTTGGAGCA CGCTGGTCCC TTGACTATAT GGACAACGTC AACCCATTTA 
20341 ACCACCACCG CAATGCTGGC CTGCGCTACC GCTCAATGTT GCTOGGCAAT GGTCGCTATG 
20401 TCCCCTTCCA CATCCAGGTG CCTCAGAAGT TCTTTGCCAT TAAAAACCTC CTTCTCCTGC 
204G1 CGGGCTCATA CACCTACGAG TGGAACTTCA GGAAGGATGT TAACATGGTT CTGCAGAGCT 
20S21 CCCTAGGAAA TGACCTAAGG GTTQACGGAG CCAGCATTAA GTTTGATAGC ATTTGCCTTT 
205B1 ACGCCACCTT CTTCCCCATG GCCCACAACA CCGCCTCCAC GCTTGAGGCC ATGCTTW3AA 
20641 ACGACACCAA CGACCAGTCC TTTAACGACT ATCTCTCCGC CGCCAACATG CTCTACCCTA 
20701 TACCCGCCAA CGCTACCAAC GTCCCCATAT CCATCCCCTC CCGCAACTGG GCGGCTTTCC 
20761 GCGGCTOGOC CTTCACGCGC CTTAAGACTA AGGAAACCCC ATCACTGGGC TCGGGCTACG 
20821 ACCCTTATTA CACCTACTCT GGCTCTATAC CCTACCTAGA TGGAACCTTT TACCTCAACC 
20881 ACACCTTTAA GAAGGTBGCC ATTACCTTTG ACTCTTCTCT CAGCTGGCCT GGCAATGACC 
20941 GCCTGCTTAC CCCCAACGAG TTTGAAATTA AGCGCTCAGT TGACGGQGAG GGTTACAACG 
21001 TTOCCCAGTG TAACATGACC AAAGACTGGT TCCTGGTACA AATOCTAGCT AACTACAACA 
21061 TTGGCTACCA GGGCTTCTAT ATCCCAGAGA GCTACAAGGA CCGCATGTAC TCCTTCTTTA 
21121 GAAACTTCCA GCCCATGAGC CGTCAGGTGG TGGATGATAC TAAATACAAG GACTACCAAC 
21181 AGGTCGGCAT CCTACACCAA CACAACAACT CTGGATTTGT TGGCTACCTT GCCCCCACCA 
21241 TGCGCGAAGG ACAGGCCTAC CCTGCTAACT TCCCCTATCC GCTTATAGGC AAGACCGCAG 
21301 TTGACAGCAT TACCCAQAAA AAGTTTCTTT GCGATCGCAC CCTTTGGCGC ATCCCATTCT 
21361 CCAGTAACTT TATOTCCATG GQCGCACTCA CAQACCTGGG CCAAAACCTT CTCTACGCCA 
21421 ACTCCGCCCA CGCGCTAGAC ATGACTTTTG AGGTGGATCC CATOQAOGAG CCCACCCTTC 
21481 TTTATGTTTT GTTTGAAGTC TTTGACGTGG TCCGTGTGCA COGGCCGCAC CGCX5GCQTCA 
21541 TCGAAACCGT GTACCTGCGC ACGCCCTTCT CGGCCGGCAA CGCCACAACA TAAAGAAGCA 
21601 AGCAACATCA ACAACAGCTG CCGCCATGGG CTCCAGTGAG CAGGAACTGA AAGCCArTOT 
21661 CAAAGATCTT GGTTGTGGGC CATATTTTTT GGGCACCTAT GACAAGCGCT TTCCAGGCTT 
21721 TGTTTCTCCA CACAAGCTCG CCTGCGCCAT AGTCAATACG GCCGGTCGCG AGACTGGGGG 
21781 CGTACACIGG ATGGCCTTTC CCTCGAACCC GCACTCAAAA ACATGCTACC TCTTTGAGCC 
21841 CTTTGGCTTT TCTGACCAGC GACTCAAGCA GGTTTACCAG TTTGAOTACG AGTCACTCCT 
21901 GCGCCGTAGC GCCATTGCTT CTTCCCCCGA CCGCTGTATA ACXSCTGGAAA AOTCCACCCA 
21961 AAGCGTACAG GGGCCCAACT CGGCCGCCTG TGGACTATTC TGCTGCATGT TTCTCCAOGC 
22021 CTTTGCCAAC TGGCCCCAAA CTCCCATGGA TCACAACCCC ACCATGAACC TTATTACOGG 
22081 06TACCCAAC TCCATGCTCA ACAGTCCCCA GGTACAGCCC ACCCTGCGTC GCAACCAGGA 
22141 ACAGCTCTAC AGCTTCCTQO AGCGCCACTC GCCCTACTTC CGCAGCCACA GTGCGCAGAT 
22201 TAGGAGOGCC ACTTCTTTTT GTCACTTGAA AAACATOTAA AAATAATCTA CTAGAGA^ 
22261 TTTCAATAAA GGCAAATGCT TTTATTTGTA CACTCT060G TGATTATTTA CCCCCACCCT 
22321 TCCCGTCTGC GCCGTTTAAA AATCAAAGGG GTTCTGCCOC GCATCGCTAT GCGCCTCTOG 
22381 CAGGGACACG TrGCX».TACT GGIGXTTAGT GCTCCACTTA AACTCAGGCA CAACCATCCG 
22441 CGGCAGCTCG GTGAAGTTTT CACTCCACAG GCTGCGCACC ATCACCAACG COTTTAOCaG 
22S01 GTCGGGCGCC GATATCTTGA AGTCGCAGTT GGGGCCTCCG CCCTGCGCGC GCQAGTTGCG 
22561 ATACACAGGG TTGCAGCACT GGAACACTAT CAGCGCCGGQ TGGTGCACGC TGGCCAQCAC 
22621 GCTCTTGTCG GAGATCAGAT CCGOOTCCAO OTCCTCCGCG TTGCTCAGGG CGAACGGAGT 
22681 CAACTTTOGT AGCTOCCTTC CCAAAAAGGG OGCGTOCCCA GGCTTTGAGT TGCACTOGCA 
22741 CCGTAGTGGC ATOUUUVGGT GAOCOTOCCC GGTCTQOOCO TTAOQA.TACA GCGCCTGCAT 
22801 AAAAGCCTTG ATCTSCTTAA AAGCCACCTG AGCCTTT6CG CCTTCAGAOA AOAACATOTO 
22861 GCAAGACTTG CCGGAAAACT GATTGGCCX3G ACAGGCCQOG TCQTGCRCOC AOCACCTTOC 
22921 OTCOGTOTTO QAOATCTGCA CCACATTTCO OCCCCACOOG TTCTTCACOA TCTTeOCCTT 
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GCTAGACTGC 
GTGCTCCTTA 
GCAGCGGTGC 
AAACGACTGC 
GGTGAAGGTC 
CAQAGCTTCC 
GTACTTGTCC 
CACACTCAGC 
CTCTTGCGTC 
CTTACCTCCT 
CGCCACATCT 
GGGCTTGGGA 
GGTCGATGGC 
GTCCTCGGAC 
CGACGGGGAC 
GCGCTCGGGG 
GCAGAAAAAG 
CGCCACCACC 
CCCGCTTGAG 
CGAGGACCGC 
CGAGGAACAA 
CGTGCTGTTG 
CAGCGATGTG 
ACCGCGCGTA 
CTTCTACCCC 
CTGCAAGATA 
GOGGCAGGGC 
GGGTCTTGGA 
TGAAAGTCAC 
AAAACGCAGC 
CATGAGCACA 
AAATTTGCAA 
CTGGCTTCAA 
AGTGCTCGTT 
GCGCAAGCTA 
CAAGATCTCC 
CCGCCTTGGG 
CCGCGACTGC 
GCAGTGCTTG 
GGACCTATGG 
CCCCGAACGC 
GTTGCAGAAC 
TOCACTTCCT 
CCACTGCTAC 
C6TGAGCGGT 
CTCCCTGGTT 
GCAGGGTCCC 
GTGGACGTCG 
GTTCTACGAA 
GGGCCACATT 
AAAGGGACGG 
GCCGCCGCAG 
AGAAGCTGCA 
AGGAGGTTTT 
AAGCTTCCGA 
CX5C0GGCGCC 
CGCCGCCGGC 



TCCTTCAGCG 
TTTATCATAA 
AGCCACAACG 
AGGTACGCCT 
AGCTGCAACC 
ACTTGGTCAG 
ATCA6CX5CGC 
GGGTTCATCA 
CGCATACCAC 
TTGCCATGCT 
TCTCTTTCTT 
GAAGGGCGCT 
CGOGGGCTGG 
TCGATAOGCC 
GGG6ACGACA 
GTGGTTTCGC 
ATCATGGAGT 
GCCTCCACCG 
GAG6AGGAAG 
TCAOTACXAA 
GTCX3GG0GGG 
AAGCATCT6C 
CCCCTCGCCA 
CCCCCCAAAC 
GTATTTGCCG 
CCCCTATCCT 
GCTGTCATAC 
CGCGACGAGA 
TCTGGAGTGT 
ATCGAGGTCA 
GTCATGAGTG 
GAACAAACAG 
ACGCGCGAGC 
ACCX5TGGAGC 
GAGGAAACAT 
AACGTGGAGC 
CAAAACX3TGC 
GTTTACTTAT 
GAGGAGTGCA 
ACGGCCTTCA 
CTGCTTAAAA 
TTTAGGAACT 
AGCGACTTTG 
CTTCTGCAGC 
GACGGTCTAC 
TGCAATTCX3C 
TCGCCTGACX5 
GCTTACCTTC 
GACCAATCCC 
CTTGGCCAAT 
GGGGTTTACT 
CCCTATCAGC 
GCTGCCGCCG 
GGACGAGGAG 
GGTCGAAGAG 
CCAQAAATCG 
ACTGCCCGTT 



CGCGCTGCCC 
TGCTTCCGTG 
CGCAGCCCGT 
GCAGGAATCG 
CGCGGTGCTC 
GCAGTAGTTT 
GCGCAGCCTC 
CCGTAATTTC 
GCGCCACTGG 
TGATTAGCAC 
CCTCGCTGTC 

GTGTGCGCGG 
GCCTCATCCG 
CGTCCTCCAT 
GCTGCTCCTC 
CAGTCGAGAA 
ATGCCGCCAA 
TGATTATCXyi 
CAGAGGATAA 

gggacgaaag 
agcgcx::agt6 
tagcggatgt 
gccaagaaaa 
tgccagaggt 
gccgtgccaa 

CTGATATCGC 
AGCGC6CGGC 
TGGTGGAACT 
CCCACTTTGC 
AGCTGATCGT 
AGGAGGGCCT 
CTGCCX5ACTT 
TTGAGTGCAT 
TGCACTACAC 
TCTQCAACCT 
TTCATTCCAC 
TTCTATGCTA 
ACCTCAAGGA 
ACGAGCGCTC 
CCCTGCAACA 
TTATCCTAGA 
TGCCCATTAA 
TAGCCAACTA 
TGGAGTGTCA 
AGCTGCTTAA 
AAAAGTCCGC 
GCAAATTTGT 
GCCXX3CCAAA 
TGCAAGCCAT 
TGGACCCCCA 
AGCAGCCGCX3 
CCACCGACGG 
GAGGAGGACA 
GTOTCAGACG 
GCAACCGGTT 
CGCCGACCCA 



GTTTTCGCTC 
TAGACACTTA 
GGGCTCGTGA 
CCCCATCATC 
CTCGTTCAGC 
GAAGTTCGCC 
CATGCCCTTC 
ACTTTCCGCT 
GTCGTCTTCA 
CGGTGGGTTG 
CACGATTACC 
CTTGGGCGCA 
CACCAGCGCX5 
CTTTTTTOGG 
GGTTGGGGGA 
TTCCCGACTG 
GAAGGACAGC 
CGCGCCTACC 
GCA6GACCCA 
AAA6CAAGAC 
GCATGGOGAC 
CGCCATTATC 
CAGCCTTGCC 
CGGCACATGC 
GCTTGCCACC 
CCGCAGCCGA 
CTCGCTCAAC 
AAACGCTCTG 
CGAGGGTGAC 
CTACCCGGCA 
GCGCCGTGCG 
ACCCGCAGTT 
GGAGGAGCGA 
GCAGCGGTTC 
CTTTC6ACAG 
GQTCTCCTAC 
GCTCAAGGGC 
CACCTGGCAG 
GCTGCAGAAA 
CGTGGCCGCG 
GGGTCTGCCA 
GCGCTCAGGA 
GTACCGCGAA 
CCTTGCCTAC 
CTGTOGCTGC 
CGAAAGTCAA 
GGCTCCGG6G 
ACCTGAOGAC 
TGCGGAGCTT 
CAACAAAGCC 
6TCCGGCGAG 
GGCCCTT6CT 
AOSAOGAOQA 
T6ATGGAAGA 
AAACACCGTC 
CCAGCATGGC 
ACCGTAGATO 



GTCACATCCA 
AGCTCGCCTT 
TOCTTGTAGG 
GTCACAAAGG 
CAGGTCTTGC 
TTTAQATCGT 
TCCCACGCAG 
TCGCTGGGCT 
TTCAGCCX5CC 
CTGAAACCCA 
TCTGGTGATG 
ATGGCCAAAT 
TCTIGTGATG 
GGCGCCCGG6 
CGTCGCGCCG 
GCCATTTCCT 
CTAACCGCCC 
ACCTTCCCCG 
GGTTTT6TAA 
CAG6ACAACG 
TACCTAGATG 
T6CGACGCGT 
TACGAACGCC 
GAOCCCAACC 
TATCACATCT 
GCGGACAAGC 
GAAGTGCCAA 
CAACAGGAAA 
AACGCGOGCC 
CTTAACCTAC 
CAGCCCCTGG 
GGCGACGAGC 
CX3CAAACTAA 
TTTGCTGACC 
GGCTACGTAC 
CTTGGAATTT 
GAG6CGCX3CC 
ACGGCCATGG 
CTGCTAAAGC 
CACCTGGCGG 
GACTTCACCA 
ATCTTGCCCG 
TGCCCTCCXJC 
CACTCTGACA 
AACCTATGCA 
ATTATCGGTA 
TTGAAACTCA 
TACCACOCCC 
ACCGCCTGCG 
CGCCAAGAGT 
GAGCTCAACC 
TCCCAGGAT6 
ATACTGGGAC 
CTGGGAGAGC 
ACCCTCGGTC 
TACAACCTCC 
GGACACCACT 



TTTCAATCAC 
CGATCTCAGC 
TCACCTCTGC 
TCTTGTTGCT 
ATACGGCCGC 
TATCCACGTG 
ACACGATCGG 
CTTCCTCTTC 
GCACTGTGCG 
CCATTTGTAG 
GCGGGCGCTC 
CCGCCGCCGA 
AGTCTTCCTC 
GAGGCGGCGG 
CACCX3CGTCC 
TCTCCTATAG 
CCTCTGAGTT 
TCQAGGCACC 
GCGAAGACGA 
CAGAGGCAAA 
TGGGAGACGA 
TGCAAGAGCG 
ACCTATTCTC 
CGCGCCTCAA 
TTTTCCAAAA 
AGCTGGCCTT 
AAATCTTTGA 
ACA6CGAAAA 
TAGCOGTACT 
CCCCCAAGGT 
AGAGGGATGC 
AGCTAGCGCG 
TGATGGCCGC 
CGGAGATGCA 
GCCAGGCCTG 
TGCAGGAAAA 
GC6ACTACGT 
GCGTTTGGCA 
AAAACTTGAA 
ACATCATTTT 
GTCAAAGCAT 
CCACCTGCTG 
CGCTTTG6GG 
TAATG6AAGA 
CCCCGCACCG 
CCTTTGAGCT 
CrCOGGGGCT 
ACQAGATTAG 
TCATTACCCA 
TTCTGCTACG 
CAATCCCCCC 
GCACCCAAAA 
AGTCAGGCAG 
CTAGACGAGG 
GCATTCCCCT 
GCTCCTCAGG 
GOAACCAGOG 
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26401 CCGGTAAGTC OVAGCAGCCG CCGCCGTTAG CCCAAGAGCA ACAAOIGCXSC CAAGGCTACC 
26461 GCTCATGGCG CGGGCACAAG AACGCCATAG riX j Cr m Cl T GCAAGACTGT GGGGGCAACA 
26521 TCTCCTTCGC CCGCCGCTTT CTTCTCTACC ATCACGGCGT GGCCTTCCCC CGTAACATCC 
265B1 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG CGGCAGCGGC AGCGGCAGCA 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTGAC AAAGCCCAAG 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC GTCTGGCGCC CAACGAACCC 
26761 GTATCGACCC GCXvAGCTTAG AAACAGGATT TTTCCCACTC TGTATGCTAT ATTTCAACAG 
26821 AGCAGGG6CC AAGAACAAGA GCTGAAAATA AAAAACAGGT CTCTGCQATC CCTCACCCGC 
26881 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA CX3CTGGAAGA CGCGGAGGCT 
26941 CTCTTCAGTA AATACTGCGC GCTQACTCTT AAGGACTAGT TTCGCGCCCT TTCTCAAATT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG CGCCAGCACC TGTCGTCAGC 
27061 GCCATTATGA GCAAGGAAAT TCCCACGCCC TACATGTGGA GTTACCAGCC ACAAATGGGA 
27121 CTTGCGGCTG GAGCTGCCCA AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC 
27181 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA ACCGAATTCT CTTGGAACAG 
27241 OCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC CGCTGCCCTG 
27301 GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC CCAGAGACGC CCA6GCCGAA 
27361 GTrCAGATGA CTAACTCAGG GGCGCAGCTT GCGGGCGGCT TTCGTCACAG OQTGCGGTCG 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG 
27481 TCX3GTGAGCT CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCX5G CGGCGCCGGC 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC CTCTGAGCCG 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT TTGTGCCATC GGTCTACTTT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT TTATTCCTAA CTTTGACGCG 
27721 GTAAAGGACT CGGCGGACGG CTACGACT6A ATGTTAAGTG GAGAGGCAGA GCAACTGCGC 
27781 CTGAAACACC TGGTCCACTG TOGCOGCXAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCAOGGCGT CCGGCTTACC 
27901 GCCCAGGGAG AGCTTOCCCG TAGCCTGATT CGGGAGTTTA CCCAGCGCCC CCTGCTAGTT 
27961 GAGCX3GGACA GGGGACCCTG TGTTCTCACT GTGATTTGCA ACTGTCCTAA CCTTGGATTA 
28021 CATCAAGATC TTTGTTGCCA TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA 
26081 CTGGGGCTCC TATCGCCATC CTGTAAACX3C CACCGTCTTC ACCCXJCCCAA GCAAACCAAG 
28X41 GCXSAACCTTA CCTGGTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA GTTTCAACCC 
28201 AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC TACTCCATCA GAAAAAACAC 
28261 CACCCTCCTT ACCTGCCGGG AACGTACX3AG TGCGTCACCG GCCGCTGCAC CACACCTACC 
28321 GCCTGACCGT AAACCAGACT TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG 
28381 GAGGTGAGCT TAGAAAACCC TTAGGGTATT AGGCCAAAGG CGCAGCTACT GTGGGGTTTA 
28441 TGAACAATTC AAGCAACTCT ACX3GGCTATT CTAATTCAGG TTTCTCTAGA AGTCAGGCTT 
28501 CCTGGATGTC AGCATCTGAC TTTGGCCAGC ACCTGTCCCG CGGATTTGTT CCAGTCCAAC 
28561 TACAGCGACC CACCCTAACA GAGATGACCA ACACAACCAA CGCGGCCGCC GCTACCGGAC 
28621 TTACATCTAC CACAAATACA CCCCAAGTTT CTGCCTTTGT CAATAACTGG GATAACTTGG 
28681 GCATGTGGTG GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
28741 GCTGCCTAAA 6CGCAAACGC GCXXX3ACCAC CCATCTATAG TCCCATCATT GTGCTACACC 
28801 CAAACAATGA TGGAATGCAT AGATTGGACG GACTGAAACA CAT6TTCTTT TCTCTTACAG 
28861 TATGATTAAA TGAGATCTAG AAATG6ACGG AATTATTACA GAGGAGCGCC TGCTAGAAA6 
28921 ACGCAGGGCA GCGGCCGAGC AACAGCGCAT GAATCAAGAG CTCCAAGACA TGGTTAACTT 
28981 GCACCAGTGC AAAAGGGGTA TCTTTTGTCT GGTAAAGCAG GCCAAAGTCA CCTACGACAG 
29041 TAATACCACC GQACACCGCX: TTAGCTACAA GTTGCCAACC AAGCX3TCAGA AATTGGTGGT 
29101 CATGGTGGGA GAAAAGCCCA TTACCATAAC TCAGCACTCG GTAGAAACCG AAGGCTGCAT 
29161 TCACTCACCT TOTCAAGQAC CTGAGGATCT CTGCACCCTT ATTAAGACCC TGTGCGGTCT 
29221 CAAAGATCTT ATTCCCTTTA ACTAATAAAA AAAAATAATA AAGCATCACT TACTTAAAAT 
29281 CAGTTAGCAA ATTTCTGTCC AGTTTATTCA OCAOCACCTC CTTQCOCTCC TCCCAGCTCT 
29341 GGTATTGCAG CTTCCTCCTG GCTGCAAACT TTCTCCACAA TCTAAATGOA ATGTCAGTTT 
29401 CCTCCTGTTC CTGTCCATCC GCACCCACTA TCTTCATGTT GTTGCAGATG AAGCGCGCAA 
29461 GACCX3TCTGA AGATACCTTC AACCCCGTGT ATCCATATGA CACGGAAACC GGTCCTCCAA 
29521 CTGTGCCTTT TCTTACTCCT CCCTTTGTAT CCCCCAATGG GTTTCAAGAG AGTCCCCCTG 
29581 GGGTACTCTC TTTGCGCCTA TCCGAACCTC TAGTTACCTC CAATGGCATG CTTGCGCTCA 
29641 AAATGGGCAA CGGCCTCTCT CTGGACGAGG CCGGCAACCT TACCTCCCAA AATGTAACCA 
29701 CTGTGAGCCC ACCTCTCAAA AAAACXJVAGT CAAACATAAA CCTGGAAATA TCTGCACCCC 
29761 TCACAGTTAC CTCAGAAOCC CTAACTGTGG CTGCCGCCGC ACCTCTAATG GTCX5CGGGCA 
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29821 ACACACTCAC CATGCAATCA CAGGCCCCX3C TAACCGTGCA CGACTCCAAA CTTAGCATTG 
29881 CCACCCAAGG ACCCCTCACA GTGTCAGAAG GAAAGCTAGC CCTGCAAACA TCAGGCCCCC 
29941 TCACCACCAC CGATAGCAGT ACCCTTACTA TCACTGCCTC ACCCCCTCTA ACTACTGCCA 
30001 CTGGTAGCTT GGGCATTGAC TTGAAAGAGC CCATTTATAC ACAAAATGGA AAACTAGGAC 
30061 TAAAGTACGG GGCTCCTTTG CATGTAACAG ACGACCTAAA CACTTTGACC GTAGCAACTG 
30121 GTCCAGGTGT GACTATTAAT AATACTTCCT TGCAAACTAA AGTTACTGGA GCCTTGGGTT 
30181 TTGATTCACA AGGCAATATG CAACTTAATG TAGCAGGAGG ACTAAGGATT GATTCTCAAA 
30241 ACAGACGCCT TATACTTGAT GTTAGTTATC CGTTTGATGC TCAAAACCAA CTAAATCTAA 
30301 GACTAGGACA GGGCCCTCTT TTTATAAACT CAGCCCACAA CTTGGATATT AACTACAACA 
30361 AAGGCCTTTA CTTGTTTACA GCTTCAAACA ATTCCAAAAA GCTPGAGGTT AACCTAAGCA 
30421 CTGCCAAGGG GTTGATGTTT GACGCTACAG CCATAGCCAT TAATGCAGGA GATGGGCTTG 
30481 AATTTGGTTC ACCTAATGCA CCAAACACAA ATCCCCTCAA AACAAAAATT GG CCATG GCC 
30541 TAGAATTTGA TTCAAACAAG GCTATGGTTC CTAAACTAGG AACTGGCCTT AGTrTTGACA 
30601 GCACAGGTGC CATTACAGTA GGAAACAAAA ATAATGATAA GCTAACTTTG TGGACCACAC 
30661 CAGCTCCATC TCCTAACTGT AGACTAAATG CAGAGAAAGA TGCTAAACTC ACTTTGGTCT 
30721 TAACAAAATC TGGCAGTCAA ATACTTGCTA CAGTTTCAGT TTTGGCTGTT AAAGGCAGTT 
30781 TCGCTCCAAT ATCTGGAACA GTTCAAAGTG CTCATCTTAT TAT AAGAT TT GACGAAAATG 
30841 GAGTGCTACT AAACAATTCC TTCCTGGACC CAGAATATT6 GAACTTTAGA AATGGAGATC 
30901 TTACTGAAGG CACAGCCTAT ACAAACXSCTG TTGGATTTAT GCCTAACCTA TCAGCTTATC 
30961 CAAAATCTCA CGGTAAAACT GCCAAAAGTA ACATTGTCAG TCAAGTTTAC TTAAACGGAG 
31021 ACAAAACTAA ACCTGTAACA CTAACCATTA CACTAAACGG TACACAGGAA ACAGGAGACA 
31081 CAACTCCAAG TGCATACTCT ATGTCATTTT CATGGGACTG GTCTGGCCAC AACTACATTA 
31141 ATCAAATATT TGCCACATCC TCTTACACTT TTTCATACAT TGCCCAAGAA TAA AGAATC G 
31201 TTTGTCTTAT GTTTCAACGT GTTTATTTTT CAATTGCAGA AAATTTCAAG TCATTTTTCA 
31261 TTCAGTAGTA TAGCCCCACC ACCACATAGC TTATACAGAT CACCGTACCT TAATCAAACT 
31321 CACAGAACCC TAGTATTCAA CCTGCCACCT CCCTCCCAAC ACACAGAGTA CACAGTCCTT 
31381 TCTCCCCGGC TGGCCTTAAA AAGCATCATA TCATGGGTAA CAGACATATT CTTAGGTGTT 
31441 ATATTCCACA CGGTTTCCTG TCGAGCCAAA CGCTCATCAG TGATATTAAT AAACTCCCCG 
31501 GGCAGCTCAC TTAAGTTCAT GTCGCTGTCC AGCTGCTGAG CCACAGGCTG CTGTCCAACT 
31561 TCCGGTTGCT TAACGGGCGG CGAAGGAGAA GTCCACGCCT ACATGGGGGT AGAGTCATAA 
31621 TCGTGCATCA GGATAGGOCG GTGGTGCT6C AGCAGCGCGC GAATAAACTG CTGCCGCCGC 
31681 CGCTCCGTCC TGCAGGAATA CAACATGGCA GTGGTCTCCT CAGCGATGAT TCGCACCGCC 
31741 CGCAGCATAA GGCGCCTTGT CCTCCGGGCA CAGCAGCGCA CCCTGATCTC ACTTAAATCA 
31801 GCACAGTAAC TGCAGCACAG CACCACAATA TTGTTCAAAA TCCCACAGTG CAAGGCGCTG 
31861 TATCCAAAGC TCATGGCGGG GACCACAGAA CCCACGTGGC CATCATACCA CAAGCGCAGG 
31921 TAGATTAAGT GGCGACCCCT CATAAACACG CTGGACATAA ACATTACCTC TTTTGGCATG 
31981 TTOTAATTCA CCACCTCCCG GTACCATATA AACCTCTGAT TAAACATGGC GCCATCCACC 
32041 ACCATCCTAA ACCAGCTGGC CAAAACCTGC CCGCCGGCTA TACACTGCAG GGAACCGGGA 
32101 CTGGAACAAT GACAGTOGAG AGCCCAGGAC TCGTAACCAT GGATCATCAT GCTCGTCATG 
32161 ATATCAATGT TGGCACAACA CAGGCACACG TGCATACACT TCCTCAGGAT TACAAGCTCC 
32221 TCCCGCGTTA GAACCATATC CCAGGGAACA ACCCATTCCT GAATCAGCGT AAATCCCACA 
32281 CTGCAGGGAA GACCTCGCAC GTAACTCACG TTGTGCATTG TCAAAGTGTT ACATTCGGGC 
32341 AGCAGCGGAT GATCCTCCAG TATGGTAGCG CX3GGTTTCTG TCTCAAAAGG AGGTAGACGA 
32401 TCCCTACTOT ACXSOAGTGCXS CCGAGACAAC CGAGATCGTG TTGGTCGTAG TGTCATGCCA 
32461 aatggaacxk: CGGACGTAGT CATATTTCCT GAAGCAAAAC CAGGTGCGGG CGTGACAAAC 
32521 AGATCTOCGT CTCCGGTCTC GCCX3CTTAGA TCGCTCTGTG TAQTAGTTGT AGTATATCCA 
32581 CTCTCTCAAA GCATCCAGGC GCCCCXnXSGC TTCGGGTTCT ATGTAAACTC CTTCATaCGC 
32641 CX5CTGCCCTG ATAACATCCA CCACCGCAGA ATAAGCXACA CCCAGCCAAC CTACACATTC 
32701 GTTCTGCGAG TCACACACGG GAGQA6CGGG AAGAGCTGGA AGAACCATGT TTTTTTTTTT 
32761 ATTCCAAAAG ATTATCCAAA ACCTCAAAAT GAAGATCTAT TAAGTGAACX3 OKTIC C X rCr rC 
32821 CGGTGGCGTG GTCAAACTCT ACAGCCAAAG AACAGATAAT GGCATTTGTA AGATGTTGCA 
32861 CAATGGCTTC CAAAAGGCAA ACGGCCCTCA CGTCCAAGTG GACGTAAAGG CTAAACCCTT 
32941 CAGGGTGAAT CTCCTCTATA AACATTCCAG CACCTTCAAC CATGCCCAAA TAATTCTCAT 
33001 CTCGCCACCT TCTCAATATA TCTCTAAGCA AATCCCGAAT ATTAAGTCCG GCCATTGTAA 
33061 AAATCTGCTC CAGAGCGCCC TCCACCTTCA GCCTCAAGCA GCGAATCATG ATTGCAAAAA 
33121 TTCAGGTTCC TCACAGACCT GTATAAGATT CAAAAGCGQA ACATTAACAA AAATACOGCG 
33181 ATCCCX5TAGG TCCCTTCX3CA GGGCCAGCTG AACATAATCO TGCAGGTCTG CACGGACCAG 
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33241 CGCGGCCACT TCCCCGCCAG GAACCTTGAC AAAAGAACCC ACACTGATTA TGACACGCAT 
33301 ACTCGGAGCT ATGCTAACCA GCGTAGCCCC GATGTAAGCT TTGTTGCATG GGCGGCGATA 
33361 TAAAATGCAA GGTGCTGCTC AAAAAATCAG GCAAAGCCTC GCGCAAAAAA GAAAGCACAT 
33421 CGTAGTCATG CTCATGCAGA TAAAGGCAGG TAAGCTCCGG AACCACCACA GAAAAAQACA 
33481 CCATTITTCT CTCAAACATG TCTGCGGGTT TCTGCATAAA CACAAAATAA AATAACAAAA 
33541 AAACATTTAA ACATTAGAAG CCTGTCTTAC AACAGGAAAA ACAACCCTTA TAAOCATAAG 
33601 ACGGACTACX3 GCXATGCCX3G CGTGACCGTA AAAAAACTGG TCACCGTGAT TAAAAAGCAC 
33661 CACCGACAGC TCCTCGGTCA TGTCCGGAGT CATAATGTAA GACTCGGTAA ACACATCAOG 
33721 TTGATTCATC GGTCAGTGCT AAAAAGCGAC CGAAATAGCC CGGGGGAATA CATACCCGCA 
33781 GGCGTAGAGA CAACATTACA GCCCCCATAG GAGGTATAAC AAAATTAATA GGAGAGAAAA 
33841 ACACATAAAC ACCTGAAAAA CCCTCCTGCC TAGGCAAAAT AGCACCCTCC CGCTCCAGAA 
33901 CAACATACAG CGCTTCACAG CGGCAGCCTA ACAGTCAGCC TTACCAGTAA AAAAGAAAAC 
33961 CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC AGTCACAGTG TAAAAAAGGG 
34021 CCAAGTGCAG AGCGAGTATA TATAGGACTA AAAAATGACG TAACGGTTAA AGTCCACAAA 
34081 AAACACCCAG AAAACCGCAC GC6AACCTAC GCCCAGAAAC GAAA6CCAAA AAACCCACAA 
34141 CTTCCTCAAA TCGTCACTTC CXJTTTTCCCA CGTTACGTAA CTTCCCATTT TAAGAAAACT 
34201 ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT ACX3TCACCCG CCCCGTTCCC 
34261 ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA 
34321 TAAGGTATAT TATTGATGAT G 
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El A Functions Major Late Transcription Unit 

• Induce Ad genes. 
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SEQUENCE LISTING 



<110> Wold, William S.M. 
Toth, Karoly 
Doronin, Konstantin 
Tollefson, Ann E. 



<120> Replication-Competent Anti-Cancer Vectors 



<130> 16153-5152 



<140> 
<141> 



<150> 09/351,778 
<151> 1999-07-12 



<160> 20 



<170> Patentin Ver. 2.0 



<210> 1 
<211> 33592 
<212> DNA 

<213> Adenovirus subgroup C 



<400> 1 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 24 0 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 
cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 
tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 
tccgacaccg ggactgaaaa tgagacatga ggtactggct gataatcttc cacctcctag 600 
ccattttgaa ccacctaccc ttcacgaact gtatgattta gacgtgacgg cccccgaaga 660 
tcccaacgag gaggcggttt cgcagatttt tcccgactct gtaatgttgg cggtgcagga 720 
agggattgac ttactcactt ttccgccggc gcccggttct ccggagccgc ctcacctttc 780 
ccggcagccc gagcagccgg agcagagagc cttgggtccg gtttgccacg aggctggctt 84 0 
tccacccagt gacgacgagg atgaagaggg tgaggagttt gtgttagatt atgtggagca 900 
ccccgggcac ggttgcaggt cttgtcatta tcaccggagg aatacggggg acccagatat 960 
tatgtgttcg ctttgctata tgaggacctg tggcatgttt gtctacagta agtgaaaatt 1020 
atgggcagtg ggtgatagag tggtgggttt ggtgtggtaa tttttttttt aatttttaca 1080 
gttttgtggt ttaaagaatt ttgtattgtg atttttttaa aaggtcctgt gtctgaacct 1140 
gagcctgagc ccgagccaga accggagcct gcaagaccta cccgccgtcc taaaatggcg 1200 
cctgctatcc tgagacgccc gacatcacct gtgtctagag aatgcaatag tagtacggat 1260 
agctgtgact ccggtccttc taacacacct cctgagatac acccggtggt cccgctgtgc 1320 
cccattaaac cagttgccgt gagagttggt gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgctta acgagcctgg gcaacctttg gacttgagct gtaaacgccc caggccataa 1440 
ggtgtaaacc tgtgattgcg tgtgtggtta acgcctttgt ttgctgaatg agttgatgta 1500 
agtttaataa agggtgagat aatgtttaac ttgcatggcg tgttaaatgg ggcggggctt 1560 
aaagggtata taatgcgccg tgggctaatc ttggttacat ctgacctcat ggaggcttgg 1620 
gagtgtttgg aagatttttc tgctgtgcgt aacttgctgg aacagagctc taacagtacc 1680 
tcttggtttt ggaggtttct gtggggctca tcccaggcaa agttagtctg cagaattaag 1740 
gaggattaca agtgggaatt tgaagagctt ttgaaatcct gtggtgagct gtttgattct 1800 
ttgaatctgg gtcaccaggc gcttttccaa gagaaggtca tcaagacttt ggatttttcc 1860 
acaccggggc gcgctgcggc tgctgttgct tttttgagtt ttataaagga taaatggagc 1920 
gaagaaaccc atctgagcgg ggggtacctg ctggattttc tggccatgca tctgtggaga 1980 
gcggttgtga gacacaagaa tcgcctgcta ctgttgtctt ccgtccgccc ggcgataata 2040 
ccgacggagg agcagcagca gcagcaggag gaagccaggc ggcggcggca ggagcagagc 2100 
ccatggaacc cgagagccgg cctggaccct cgggaatgaa tgttgtacag gtggctgaac 2160 
tgtatccaga actgagacgc attttgacaa ttacagagga tgggcagggg ctaaaggggg 2220 
taaagaggga gcggggggct tgtgaggcta cagaggaggc taggaatcta gcttttagct 2280 
taatgaccag acaccgtcct gagtgtatta cttttcaaca gatcaaggat aattgcgcta 2340 
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atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 



ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 2460 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 27 60 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4 440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4 920 
ccagaaggcg ctcgccgccc agcgatagca 4980 
gtttgagacc gtccgccgta ggcatgcttt 504 0 
cccacagctc ggccacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 534 0 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 54 60 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
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gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7 620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtc'tcg 77 4 0 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7 920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 8460 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc ^gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 94 20 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
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tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 1134 0 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 114 60 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 117 60 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tccggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12 600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
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ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
' ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14640 

agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14 820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 1704 0 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 



wo 01/04282 



6 



PCT/USOO/18971 



aaccaggcag tgcaaaataa gattaacagt 

cctccaccgg ccgtggagac agtgtctcca 

gacagggaag aaactctggt gacgcaaata 

aagcaaggcc tgcccaccac ccgtcccatc 

cacacacccg taacgctgga cctgcctccc 

ccaggcccga ccgccgttgt tgtaacccgt 

agcggtccgc gatcgttgcg gcccgtagcc 

atcgtgggtc tgggggtgca atccctgaag 

tcgtatgtgt gtcatgtatg cgtccatgtc 

gcccgctttc caagatggct accccttcga 

cgggccagga cgcctcggag tacctgagcc 

agacgtactt cagcctgaat aacaagttta 

tgaccacaga ccggtcccag cgtttgacgc 

ctgcgtactc gtacaaggcg cggttcaccc 

tggcttccac gtactttgac atccgcggcg 

actctggcac tgcctacaac gccctggctc 

atgaagctgc tactgctctt gaaataaacc 

aagtagacga gcaagctgag cagcaaaaaa 

gtataaatat tacaaaggag ggtattcaaa 

ccgataaaac atttcaacct gaacctcaaa 

ttaatcatgc agctgggaga gtccttaaaa 

catatgcaaa acccacaaat gaaaatggag 

gaaagctaga aagtcaagtg gaaatgcaat 

atggtgataa cttgactcct aaagtggtat 

cagacactca tatttcttac atgcccacta 

gccaacaatc tatgcccaac aggcctaatt 

taatgtatta caacagcacg ggtaatatgg 

atgctgttgt agatttgcaa gacagaaaca 

ccattggtga tagaaccagg tacttttcta 

cagatgttag aattattgaa aatcatggaa 

cactgggagg tgtgattaat acagagactc 

aaaatggatg ggaaaaagat gctacagaat 

ataattttgc catggaaatc aatctaaatg 

acatagcgct gtatttgccc gacaagctaa 

ataacccaaa cacctacgac tacatgaaca 

gctacattaa ccttggagca cgctggtccc 

accaccaccg caatgctggc ctgcgctacc 

tgcccttcca catccaggtg cctcagaagt 

cgggctcata cacctacgag tggaacttca 

ccctaggaaa tgacctaagg gttgacggag 

acgccacctt cttccccatg gcccacaaca 

acgacaccaa cgaccagtcc tttaacgact 

tacccgccaa cgctaccaac gtgcccatat 

gcggctgggc cttcacgcgc cttaagacta 

acccttatta cacctactct ggctctatac 

acacctttaa gaaggtggcc attacctttg 

gcctgcttac ccccaacgag tttgaaatta 

ttgcccagtg taacatgacc aaagactggt 

ttggctacca gggcttctat atcccagaga 

gaaacttcca gcccatgagc cgtcaggtgg 

aggtgggcat cctacaccaa cacaacaact 

tgcgcgaagg acaggcctac cctgctaact 

ttgacagcat tacccagaaa aagtttcttt 

ccagtaactt tatgtccatg ggcgcactca 

actccgccca cgcgctagac atgacttttg 

tttatgtttt gtttgaagtc tttgacgtgg 

tcgaaaccgt gtacctgcgc acgcccttct 

agcaacatca acaacagctg ccgccatggg 

caaagatctt ggttgtgggc catatttttt 

tgtttctcca cacaagctcg cctgcgccat 

cgtacactgg atggcctttg cctggaaccc 

ctttggcttt tctgaccagc gactcaagca 

gcgccgtagc gccattgctt cttcccccga 

aagcgtacag gggcccaact cggccgcctg 

ctttgccaac tggccccaaa ctcccatgga 

ggtacccaac tccatgctca acagtcccca 



aagcttgatc cccgccctcc cgtagaggag 1824 0 
gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacgagcctc cctcgtacga ggaggcacta 18360 
gcgcccatgg ctaccg^gagt gctgggccag 18420 
cccgccgaca cccagcagaa acctgtgctg 18480 
cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agtggcaact ggcaaagcac actgaacagc 18600 
cgccgacgat gcttctgaat agctaacgtg 18660 
gccgccagag gagctgctga gccgccgcgc 18720 
tgatgccgca gtggtcttac atgcacatct 18780 
ccgggctggt gcagtttgcc cgcgccaccg 18840 
gaaaccccac ggtggcgcct acgcacgacg 18900 
tgcggttcat ccctgtggac cgtgaggata 18960 
tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 194 40 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 197 40 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 198 60 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 204 60 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 



wo 01/04282 



7 



PCTAJSOO/18971 



acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24 420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 2454 0 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 2484 0 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 2514 0 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 254 4 0 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
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agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 2724 0 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27 480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27 660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga taattaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gaggatcttt gttgccatct ctgtgctgag 27 900 
tataataaat acagaaatta aaatatactg gggctcctat cgccatcctg taaacgccac 27 960 
cgtcttcacc cgcccaagca aaccaaggcg aaccttacct ggtactttta acatctctcc 28020 
ctctgtgatt tacaacagtt tcaacccaga cggagtgagt ctacgagaga acctctccga 28080 
gctcagctac tccatcagaa aaaacaccac cctccttacc tgccgggaac gtacccttaa 28140 
ttaaaagtca ggcttcctgg atgtcagcat ctgactttgg ccagcacctg tcccgcggat 28200 
ttgttccagt ccaactacag cgacccaccc taacagagat gaccaacaca accaacgcgg 28260 
ccgccgctac cggacttaca tctaccacaa atacacccca agtttctgcc tttgtcaata 28320 
actgggataa cttgggcatg tggtggttct ccatagcgct tatgtttgta tgccttatta 28380 
ttatgtggct catctgctgc ctaaagcgca aacgcgcccg accacccatc tatagtccca 28440 
tcattgtgct acacccaaac aatgatggaa tccatagatt ggacggactg aaacacatgt 28500 
tcttttctct tacagtatga ttaaatgaga ttaattaagg aatttctgtc cagtttattc 28560 
agcagcacct ccttgccctc ctcccagctc tggtattgca gcttcctcct ggctgcaaac 28620 
tttctccaca atctaaatgg aatgtcagtt tcctcctgtt cctgtccatc cgcacccact 28680 
atcttcatgt tgttgcagat gaagcgcgca agaccgtctg aagatacctt caaccccgtg 28740 
tatccatatg acacggaaac cggtcctcca actgtgcctt ttcttactcc tccctttgta 28800 
tcccccaatg ggtttcaaga gagtccccct ggggtactct ctttgcgcct atccgaacct 28860 
ctagttacct ccaatggcat gcttgcgctc aaaatgggca acggcctctc tctggacgag 28920 
gccggcaacc ttacctccca aaatgtaacc actgtgagcc cacctctcaa aaaaaccaag 28980 
tcaaacataa acctggaaat atctgcaccc ctcacagtta cctcagaagc cctaactgtg 29040 
gctgccgccg cacctctaat ggtcgcgggc aacacactca ccatgcaatc acaggccccg 29100 
ctaaccgtgc acgactccaa acttagcatt gccacccaag gacccctcac agtgtcagaa 29160 
ggaaagctag ccctgcaaac atcaggcccc ctcaccacca ccgatagcag tacccttact 29220 
atcactgcct caccccctct aactactgcc actggtagct tgggcattga cttgaaagag 29280 
cccatttata cacaaaatgg aaaactagga ctaaagtacg gggctccttt gcatgtaaca 29340 
gacgacctaa acactttgac cgtagcaact ggtccaggtg tgactattaa taatacttcc 29400 
ttgcaaacta aagttactgg agccttgggt tttgattcac aaggcaatat gcaacttaat 2 94 60 
gtagcaggag gactaaggat tgattctcaa aacagacgcc ttatacttga tgttagttat 2 9520 
ccgtttgatg ctcaaaacca actaaatcta agactaggac agggccctct ttttataaac 2 9580 
tcagcccaca acttggatat taactacaac aaaggccttt acttgtttac agcttcaaac 29640 
aattccaaaa agcttgaggt taacctaagc actgccaagg ggttgatgtt tgacgctaca 29700 
gccatagcca ttaatgcagg agatgggctt gaatttggtt cacctaatgc accaaacaca 297 60 
aatcccctca aaacaaaaat tggccatggc ctagaatttg attcaaacaa ggctatggtt 29820 
cctaaactag gaactggcct tagttttgac agcacaggtg ccattacagt aggaaacaaa 29880 
aataatgata agctaacttt gtggaccaca ccagctccat ctcctaactg tagactaaat 29940 
gcagagaaag atgctaaact cactttggtc ttaacaaaat gtggcagtca aatacttgct 30000 
acagtttcag ttttggctgt taaaggcagt ttggctccaa tatctggaac agttcaaagt 30060 
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gctcatctta 
ccagaatatt 
qttggattta 
aacattgtca 
acactaaacg 
tcatgggact 
ttttcataca 
tcaattgcag 
cttatacaga 
tccctcccaa 
atcatgggta 
acgctcatca 
cagctgctga 
agtccacgcc 
cagcagcgcg 
agtggtctcc 
acagcagcgc 
attgttcaaa 
acccacgtgg 
gctggacata 
aaacctctga 
cccgccggct 
ctcgtaacca 
gtgcatacac 
aacccattcc 
gttgtgcatt 
gcgggtttct 
ccgagatcgt 
tgaagcaaaa 
atcgctctgt 
cttcgggttc 
aataagccac 
gaagagctgg 
tgaagatcta 
gaacagataa 
acgtccaagt 
gcaccttcaa 
aaatcccgaa 
agcctcaagc 
tcaaaagcgg 
gaacataatc 
caaaagaacc 
cgatgtaagc 
ggcaaagcct 
gtaagctccg 
ttctgcataa 
caacaggaaa 
aaaaaaactg 
tcataatgta 
ccgaaatagc 
ggaggtataa 
ctaggcaaaa 
aacagtcagc 
ccagctcaat 
aaaaaatgac 
cgcccagaaa 
acgttacgta 
ccctaaaacc 
ccctcattat 



ttataagatt 
ggaactttag 
tgcctaacct 
gtcaagttta 
gtacacagga 
ggtctggcca 
ttgcccaaga 
aaaatttcaa 
tcaccgtacc 
cacacagagt 
acagacatat 
gtgatattaa 
gccacaggct 
tacatggggg 
cgaataaact 
tcagcgatga 
accctgatct 
atcccacagt 
ccatcatacc 
aacattacct 
ttaaacatgg 
atacactgca 
tggatcatca 
ttcctcagga 
tgaatcagcg 
gtcaaagtgt 
gtctcaaaag 
gttggtcgta 
ccaggtgcgg 
gtagtagttg 
tatgtaaact 
acccagccaa 
aagaaccatg 
ttaagtgaac 
tggcatttgt 
ggacgtaaag 
ccatgcccaa 
tattaagtcc 
agcgaatcat 
aacattaaca 
gtgcaggtct 
cacactgatt 
tttgttgcat 
cgcgcaaaaa 
gaaccaccac 
acacaaaata 
aacaaccctt 
gtcaccgtga 
agactcggta 
ccgggggaat 
caaaattaat 
tagcaccctc 
cttaccagta 
cagtcacagt 
gtaacggtta 
cgaaagccaa 
acttcccatt 
tacgtcaccc 
catattggct 



tgacgaaaat 
aaatggagat 
atcagcttat 
cttaaacgga 
aacaggagac 
caactacatt 
ataaagaatc 
gtcatttttc 
ttaatcaaac 
aicacagtcct 
tcttaggtgt 
taaactcccc 
gctgtccaac 
tagagtcata 
gctgccgccg 
ttcgcaccgc 
cacttaaatc 
gcaaggcgct 
acaagcgcag 
cttttggcat 
cgccatccac 
gggaaccggg 
tgctcgtcat 
ttacaagctc 
taaatcccac 
tacattcggg 
gaggtagacg 
gtgtcatgcc 
gcgtgacaaa 
tagtatatcc 
ccttcatgcg 
cctacacatt 
tttttttttt 
gcgctcccct 
aagatgttgc 
gctaaaccct 
ataattctca 
ggccattgta 
gattgcaaaa 
aaaataccgc 
gcacggacca 
atgacacgca 
gggcggcgat 
agaaagcaca 
agaaaaagac 
aaataacaaa 
ataagcataa 
ttaaaaagca 
aacacatcag 
acatacccgc 
aggagagaaa 
ccgctccaga 
aaaaagaaaa 
gtaaaaaagg 
aagtccacaa 
aaaacccaca 
ttaagaaaac 
gccccgttcc 
tcaatccaaa 



ggagtgctac 
cttactgaag 
ccaaaatctc 
gacaaaacta 
acaactccaa 
aatgaaatat 
gtttgtgtta 
attcagtagt 
tcacagaacc 
ttctccccgg 
tatattccac 
gggcagctca 
ttgcggttgc 
atcgtgcatc 
ccgctccgtc 
ccgcagcata 
agcacagtaa 
gtatccaaag 
gtagattaag 
gttgtaattc 
caccatccta 
actggaacaa 
gatatcaatg 
ctcccgcgtt 
actgcaggga 
cagcagcgga 
atccctactg 
aaatggaacg 
cagatctgcg 
actctctcaa 
ccgctgccct 
cgttctgcga 
tattccaaaa 
ccggtggcgt 
acaatggctt 
tcagggtgaa 
tctcgccacc 
aaaatctgct 
attcaggttc 
gatcccgtag 
gcgcggccac 
tactcggagc 
ataaaatgca 
tcgtagtcat 
accatttttc 
aaaacattta 
gacggactac 
ccaccgacag 
gttgattcat 
aggcgtagag 
aacacataaa 
acaacataca 
cctattaaaa 
gccaagtgca 
aaaacaccca 
acttcctcaa 
tacaattccc 
cacgccccgc 
ataaggtata 



taaacaattc 
gcacagccta 
acggtaaaac 
aacctgtaac 
gtgcatactc 
ttgccacatc 
tgtttcaacg 
atagccccac 
ctagtattca 
ctggccttaa 
acggtttcct 
cttaagttca 
ttaacgggcg 
aggatagggc 
ctgcaggaat 
aggcgccttg 
ctgcagcaca 
ctcatggcgg 
tggcgacccc 
accacctccc 
aaccagctgg 
tgacagtgga 
ttggcacaac 
agaaccatat 
agacctcgca 
tgatcctcca 
tacggagtgc 
ccggacgtag 
tctccggtct 
agcatccagg 
gataacatcc 
gtcacacacg 
gattatccaa 
ggtcaaactc 
ccaaaaggca 
tctcctctat 
ttctcaatat 
ccagagcgcc 
ctcacagacc 
gtcccttcgc 
ttccccgcca 
tatgctaacc 
aggtgctgct 
gctcatgcag 
tctcaaacat 
aacattagaa 
ggccatgccg 
ctcctcggtc 
cggtcagtgc 
acaacattac 
cacctgaaaa 
gcgcttcaca 
aaacaccact 
gagcgagtat 
gaaaaccgca 
atcgtcactt 
aacacataca 
gccacgtcac 
ttattgatga 



cttcctggac 
tacaaacgct 
tgccaaaagt 
actaaccatt 
tatgtcattt 
ctcttacact 
tgtttatttt 
caccacatag 
acctgccacc 
aaagcatcat 
gtcgagccaa 
tgtcgctgtc 
gcgaaggaga 
ggtggtgctg 
acaacatggc 
tcctccgggc 
gcaccacaat 
ggaccacaga 
tcataaacac 
ggtaccatat 
ccaaaacctg 
gagcccagga 
acaggcacac 
cccagggaac 
cgtaactcac 
gtatggtagc 
gccgagacaa 
tcatatttcc 
cgccgcttag 
cgccccctgg 
accaccgcag 
ggaggagcgg 
aacctcaaaa 
tacagccaaa 
aacggccctc 
aaacattcca 
atctctaagc 
ctccaccttc 
tgtataagat 
agggccagct 
ggaaccttga 
agcgtagccc 
caaaaaatca 
ataaaggcag 
gtctgcgggt 
gcctgtctta 
gcgtgaccgt 
atgtccggag 
taaaaagcga 
agcccccata 
accctcctgc 
gcggcagcct 
cgacacggca 
atataggact 
cgcgaaccta 
ccgttttccc 
agttactccg 
aaactccacc 
tg 



30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33592 



<210> 2 
<211> 34341 
<212> DNA 

<213> Adenovirus subgroup C 



<400> 2 
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catcatcaac aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
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ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
rctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 



taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 44 4 0 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4620 
ccggcccagg ggcgtagtta ccctcacaga 4680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 48 60 
gcaggggggc cacttcgtta agcatgtccc 4920 
ccagaaggcg ctcgccgccc agcgatagca 4980 
gtttgagacc gtccgccgta ggcatgcttt 504 0 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 534 0 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 54 60 
atccgcgccg caggccccgc agacggtctc 552 0 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 564 0 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 57 60 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 594 0 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 7140 
cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 7440 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7620 
acctatggcc attttttctg gggtgatgca 7680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
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gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgag.tcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 
gagcctgtaa gcgggcactc ttccgtggtc 
ggacgaccgg ggttcgagcc ccgtatccgg 
cgcgtgtcga acccaggtgt gcgacgtcag 
tccaggcgcg gcggctgctg cgctagcttt 
gttaggctgg aaagcgaaag cattaagtgg 
caagggttga gtcgcgggac ccccggttcg 
ggggtttgcc tccccgtcat gcaagacccc 
gccccttttt tgcttttccc agatgcatcc 
gcagcggcaa gagcaagagc agcggcagac 
gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 



cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 9480 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 102 60 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
tggtggataa attcgcaagg gtatcatggc 10560 
ccgtccgccg tgatccatgc ggttaccgcc 10620 
acaacggggg agtgctcctt ttggcttcct 10680 
tttggccact ggccgcgcgc agcgtaagcg 10740 
ctcgctccct gtagccggag ggttattttc 10800 
agtctcggac cggccggact gcggcgaacg 10660 
gcttgcaaat tcctccggaa acagggacga 10920 
ggtgctgcgg cagatgcgcc cccctcctca 10980 
atgcagggca ccctcccctc ctcctaccgc 11040 
ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 11460 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
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cgacctgggc 

cgagctcagc 

cggcgataga 

ccgacgcgcc 

tggcaacgtc 

cgagtactaa 

cgggcggccc 

atggaccgca 

gccaaccggc 

gagaaggtgc 

gccggcctgg 

cagaccaacc 

gcgcagcagc 

cccgccaacg 

atggtgactg 

accagtagac 

ctgtgggggg 

aactcgcgcc 

gacacatacc 

gacgagcata 

ggcagcctgg 

ttgcacagtt 

cttaacctga 

atggaaccgg 

catcgcgcgg 

ctaccgcccc 

ctctgggacg 

caacagcgcg 

ttgtccgatc 

atagggtctc 

ctaaacaact 

aacgggatag 

agggacgtgc 

ctggtgtggg 

ggcaacccgt 

atgatgcaaa 

cccttagtat 

tggtgagcgc 

cgccgtttgt 

ctgagttggc 

atgtggcatc 

acaatgacta 

actggggcgg 

tgtttaccaa 

aggtggagct 

ccatgaccat 

agaacggggt 

ggtttgaccc 

cagacatcat 

tgttgggcat 

tggagggtgg 

atgacaccga 

aagagaactc 

ccattcgcgg 

cggccgaagc 

tgatcaaacc 

gcaccttcac 

gaatccgctc 

actggtcgtt 

gcaactttcc 

accaggccgt 

gctttcccga 

aaaacgttcc 

tccagcgagt 

tgggcatagt 

ttatatcgcc 



gtttatcgca 
gaccgcgagc 
gaggccgagt 
ctggaggcag 
ggcggcgtgg 
gcggtgatgt 
tgcagagcca 
tcatgtcgct 
tctccgcaat 
tggcgatcgt 
tctacgacgc 
tggaccggct 
agggcaacct 
tgccgcgggg 
agacaccgca 
aaggcctgca 
tgcgggctcc 
tgttgctgct 
taggtcactt 
ctttccagga 
aggcaaccct 
taaacagcga 
tgcgcgacgg 
gcatgtatgc 
ccgccgtgaa 
ctggtttcta 
acatagacga 
agcaggcaga 
taggcgctgc 
ttaccagcac 
cgctgctgca 
agagcctagt 
caggcccgcg 
aggacgatga 
ttgcgcacct 
ataaaaaact 
gcggcgcgcg 
ggcgccagtg 
gcctccgcgg 
acccctattc 
cctgaactac 
cagcccgggg 
cgacctgaaa 
taagtttaag 
gaaatacgag 
agaccttatg 
tctggaaagc 
cgtcactggt 
tttgctgcca 
ccgcaagcgg 
taacattccc 
acagggcggg 
caacgcggca 
cgacaccttt 
tgccgccccc 
cctgacagag 
ccagtaccgc 
atggaccctg 
gccagacatg 
ggtggtgggc 
ctactcccaa 
gaaccagatt 
tgctctcaca 
gaccattact 
ctcgccgcgc 
cagcaataac 



acgagcgcat 
tgatgcacag 
cctactttga 
ctggggccgg 
aggaatatga 
ttctgatcag 
gccgtccggc 
gactgcgcgc 
tctggaagcg 
aaacgcgctg 
gctgcttcag 
ggcgggggat 
gggctccatg 
acaggaggac 
aagtgaggtg 
gaccgtaaac 
cacaggcgac 
gctaatagcg 
gctgacactg 
gattacaagt 
aaactacctg 
ggaggagcgc 
ggtaacgccc 
ctcaaaccgg 
ccccgagtat 
caccggggga 
cagcgtgttt 
ggcggcgctg 
ggccccgcgg 
tcgcaccacc 
gccgcagcgc 
ggacaagatg 
cccgcccacc 
ctcggcagac 
tcgccccagg 
caccaaggcc 
gcgatgtatg 
gcggcggcgc 
tacctgcggc 
gacaccaccc 
cagaacgacc 
gaggcaagca 
accatcctgc 
gcgcgggtga 
tgggtggagt 
aacaacgcga 
gacatcgggg 
cttgtcatgc 
ggatgcgggg 
caacccttcc 
gcactgttgg 
ggtggcgcag 
gccgcggcaa 
gccacacggg 
gctgcgcaac 
gacagcaaga 
agctggtacc 
ctttgcactc 
atgcaagacc 
gccgagctgt 
ctcatccgcc 
ttggcgcgcc 
gatcacggga 
gacgccagac 
gtcctatcga 
acaggctggg 



ccacaaggcc 
cctgcaaagg 
cgcgggcgct 
acctgggctg 
cgaggacgat 
atgatgcaag 
cttaactcca 
aatcctgacg 
gtggtcccgg 
gccgaaaaca 
cgcgtggctc 
gtgcgcgagg 
gttgcactaa 
tacaccaact 
taccagtctg 
ctgagccagg 
cgcgcgaccg 
cccttcacgg 
raccgcgagg 
gtcagccgcg 
ctgaccaacc 
attttgcgct 
agcgtggcgc 
ccgtttatca 
ttcaccaatg 
ttcgaggtgc 
tccccgcaac 
cgaaaggaaa 
tcagatgcta 
cgcccgcgcc 
gaaaaaaacc 
agtagatgga 
cgtcgtcaaa 
gacagcagcg 
ctggggagaa 
atggcaccga 
aggaaggtcc 
tgggttctcc 
ctaccggggg 
gtgtgtacct 
acagcaactt 
cacagaccat 
ataccaacat 
tggtgtcgcg 
tcacgctgcc 
tcgtggagca 
taaagtttga 
ctggggtata 
tggacttcac 
aggagggctt 
atgtggacgc 
gcggcagcaa 
tgcagccggt 
ctgaggagaa 
ccgaggtcga 
aacgcagtta 
ttgcatacaa 
ctgacgtaac 
ccgtgacctt 
tgcccgtgca 
agtttacctc 
cgccagcccc 
cgctaccgct 
gccgcacctg 
gccgcacttt 
gcctgcgctt 



gtgagcgtga 
gccctggctg 
gacctgcgct 
gcggtggcac 
gagtacgagc 
acgcaacgga 
cggacgactg 
cgttccggca 
cgcgcgcaaa 
gggccatccg 
gttacaacag 
ccgtggcgca 
acgccttcct 
ttgtgagcgc 
ggccagacta 
ctttcaaaaa 
tgtctagctt 
acagtggcag 
ccataggtca 
cgctggggca 
ggcggcagaa 
acgtgcagca 
tggacatgac 
accgcctaat 
ccatcttgaa 
ccgagggtaa 
cgcagaccct 
gcttccgcag 
gtagcccatt 
tgctgggcga 
tgcctccggc 
agacgtacgc 
ggcacgaccg 
tcctggattt 
tgttttaaaa 
gcgttggttt 
tcctccctcc 
cttcgatgct 
gagaaacagc 
ggtggacaac 
tctgaccacg 
caatcttgac 
gccaaatgtg 
cttgcctact 
cgagggcaac 
ctacttgaaa 
cacccgcaac 
tacaaacgaa 
ccacagccgc 
taggatcacc 
ctaccaggcg 
cagcagtggc 
ggaggacatg 
gcgcgctgag 
gaagcctcag 
caacctaata 
ctacggcgac 
ctgcggctcg 
ccgctccacg 
ctccaagagc 
tctgacccac 
caccatcacc 
gcgcaacagc 
cccctacgtt 
ttgagcaagc 
cccaagcaag 



gccggcggcg 
gcacgggcag 
gggccccaag 
ccgcgcgcgc 
cagaggacgg 
cccggcggtc 
gcgccaggtc 
gcagccgcag 
ccccacgcac 
gcccgacgag 
cggcaacgtg 
gcgtgagcgc 
gagtacacag 
actgcggcta 
ttttttccag 
cttgcagggg 
gctgacgccc 
cgtgtcccgg 
ggcgcatgtg 
ggaggacacg 
gatcccctcg 
gagcgtgagc 
cgcgcgcaac 
ggactacttg 
cccgcactgg 
cgatggattc 
gctagagttg 
gccaagcagc 
tccaagcttg 
ggaggagtac 
atttcccaac 
gcaggagcac 
tcagcggggt 
gggagggagt 
aaaaaaaagc 
tcttgtattc 
tacgagagtg 
cccctggacc 
atccgttact 
aagtcaacgg 
gtcattcaaa 
gaccggtcgc 
aacgagttca 
aaggacaatc 
tactccgaga 
gtgggcagac 
ttcagactgg 
gccttccatc 
ctgagcaact 
tacgatgatc 
agcttgaaag 
agcggcgcgg 
aacgatcatg 
gccgaagcag 
aagaaaccgg 
agcaatgaca 
cctcagaccg 
gagcaggtct 
cgccagatca 
ttctacaacg 
gtgttcaatc 
accgtcagtg 
atcggaggag 
tacaaggccc 
atgtccatcc 
atgtttggcg 
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gggccaagaa 

ggggcgcgca 

tggtggagga 

ccattcagac 

gcgtagcacg 

tgcttaaccg 

ccgcgggtat 

cggccattag 

ttagcggcct 

actacttaga 

ccaagcgcaa 

cgaagaagga 

aagatgatga 

gacgggtaca 

tctttacgcc 

gcgacgagga 

ggcataagga 

ccgtaacact 

agcgcgagtc 

tggaagatgt 

ggccaatcaa 

ctaccagtag 

ttgcctcagc 

ctacggaggt 

gttcgaggaa 

ttgcgcctac 

gacgccgaac 

cgatttccgt 

gctaccaccc 

cctgccgcct 

tggccggcca 

cgcaccgtcg 

ttggcgccgt 

caagttgcat 

taactatttt 

cgcccgttca 

agctggggct 

agcaaggcct 

ttccaacaaa 

aaccaggcag 

cctccaccgg 

gacagggaag 

aagcaaggcc 

cacacacccg 

ccaggcccga 

agcggtccgc 

atcgtgggtc 

tcgtatgtgt 

gcccgctttc 

cgggccagga 

agacgtactt 

tgaccacaga 

ctgcgtactc 

tggcttccac 

actctggcac 

atgaagctgc 

aagtagacga 

gtataaatat 

ccgataaaac 

ttaatcatgc 

catatgcaaa 

gaaagctaga 

atggtgataa 

cagacactca 

gccaacaatc 

taatgtatta 



gcgctccgac 
caaacgcggc 
ggcgcgcaac 
cgtggtgcgc 
tcgccaccgc 
cgcacgtcgc 
tgtcactgtg 
tgctatgact 
gcgcgtgccc 
ctcgtactgt 
aatcaaagaa 
agagcaggat 
tgatgaactt 
gtggaaaggt 
cggtgagcgc 
cctgcttgag 
catgctggcg 
gcagcaggtg 
tggtgacttg 
cttggaaaaa 
gcaggtggcg 
caccagtatt 
ggtggcggat 
gcaaacggac 
gtacggcgcc 
ccccggctat 
caccactgga 
gcgcagggtg 
cagcatcgtt 
ccgtttcccg 
cggcctgacg 
catgcgcggc 
gcccggaatt 
gtggaaaaat 
gtagaatgga 
tgggaaactg 
cgctgtggag 
ggaacagcag 
aggtggtaga 
tgcaaaataa 
ccgtggagac 
aaactctggt 
tgcccaccac 
taacgctgga 
ccgccgttgt 
gatcgttgcg 
tgggggtgca 
gtcatgtatg 
caagatggct 
cgcctcggag 
cagcctgaat 
ccggtcccag 
gtacaaggcg 
gtactttgac 
tgcctacaac 
tactgctctt 
gcaagctgag 
tacaaaggag 
atttcaacct 
agctgggaga 
acccacaaat 
aagtcaagtg 
cttgactcct 
tatttcttac 
tatgcccaac 
caacagcacg 



caacacccag 
cgcactgggc 
tacacgccca 
ggagcccggc 
cgccgacccg 
accggccgac 
ccccccaggt 
cagggtcgca 
gtgcgcaccc 
tgtatgtatc 
gagatgctcc 
tacaagcccc 
gacgacgagg 
cgacgcgtaa 
tccacccgca 
caggccaacg 
ttgccgctgg 
ctgcccgcgc 
gcacccaccg 
atgaccgtgg 
ccgggactgg 
gccaccgcca 
gccgcggtgc 
ccgtggatgt 
gccagcgcgc 
cgtggctaca 
acccgccgcc 
gctcgcgaag 
taaaagccgg 
gtgccgggat 
ggcggcatgc 
ggtatcctgc 
gcatccgtgg 
caaaataaaa 
agacatcaac 
gcaagatatc 
cggcattaaa 
cacaggccag 
tggcctggcc 
gattaacagt 
agtgtctcca 
gacgcaaata 
ccgtcccatc 
cctgcctccc 
tgtaacccgt 
gcccgtagcc 
atccctgaag 
cgtccatgtc 
accccttcga 
tacctgagcc 
aacaagttta 
cgtttgacgc 
cggttcaccc 
atccgcggcg 
gccctggctc 
gaaataaacc 
cagcaaaaaa 
ggtattcaaa 
gaacctcaaa 
gtccttaaaa 
gaaaatggag 
gaaatgcaat 
aaagtggtat 
atgcccacta 
aggcctaatt 
ggtaatatgg 



tgcgcgtgcg 
gcaccaccgt 
cgccgccacc 
gctatgctaa 
gcactgccgc 
gggcggccat 
ccaggcgacg 
ggggcaacgt 
gccccccgcg 
cagcggcggc 
aggtcatcgc 
gaaagctaaa 
tggaactgct 
aacgtgtttt 
cctacaagcg 
agcgcctcgg 
acgagggcaa 
ttgcaccgtc 
tgcagctgat 
aacctgggct 
gcgtgcagac 
cagagggcat 
aggcggtcgc 
ttcgcgtttc 
tactgcccga 
cctaccgccc 
gccgtcgccg 
gaggcaggac 
tctttgtggt 
tccgaggaag 
gtcgtgcgca 
ccctccttat 
ccttgcaggc 
agtctggact 
tttgcgtctc 
ggcaccagca 
aatttcggtt 
atgctgaggg 
tctggcatta 
aagcttgatc 
gaggggcgtg 
gacgagcctc 
gcgcccatgg 
cccgccgaca 
cctagccgcg 
agtggcaact 
cgccgacgat 
gccgccagag 
tgatgccgca 
ccgggctggt 
gaaaccccac 
tgcggttcat 
tagctgtggg 
tgctggacag 
ccaagggtgc 
tagaagaaga 
ctcacgtatt 
taggtgtcga 
taggagaatc 
agactacccc 
ggcaaggcat 
ttttctcaac 
tgtacagtga 
ttaaggaagg 
acattgcttt 
gtgttctggc 



cgggcactac 
cgatgacgcc 
agtgtccaca 
aatgaagaga 
ccaacgcgcg 
gcgggccgct 
agcggccgcc 
gtattgggtg 
caactagatt 
ggcgcgcaac 
gccggagatc 
gcgggtcaaa 
gcacgctacc 
gcgacccggc 
cgtgtatgat 
ggagtttgcc 
cccaacacct 
cgaagaaaag 
ggtacccaag 
ggagcccgag 
cgtggacgtt 
ggagacacaa 
tgcggccgcg 
agccccccgg 
atatgcccta 
cagaagacga 
tcgccagccc 
cctggtgctg 
tcttgcagat 
aatgcaccgt 
ccaccggcgg 
tccactgatc 
gcagagacac 
ctcacgctcg 
tggccccgcg 
atatgagcgg 
ccaccgttaa 
ataagttgaa 
gcggggtggt 
cccgccctcc 
gcgaaaagcg 
cctcgtacga 
ctaccggagt 
cccagcagaa 
cgtccctgcg 
ggcaaagcac 
gcttctgaat 
gagctgctga 
gtggtcttac 
gcagtttgcc 
ggtggcgcct 
ccctgtggac 
tgataaccgt 
gggccctact 
cccaaatcct 
ggacgatgac 
tgggcaggcg 
aggtcaaaca 
tcagtggtac 
aatgaaacca 
tcttgtaaag 
tactgaggcg 
agatgtagat 
taactcacga 
tagggacaat 
gggccaagca 



cgcgcgccct 
atcgacgcgg 
gtggacgcgg 
cggcggaggc 
gcggcggccc 
cgaaggctgg 
gcagcagccg 
cgcgactcgg 
gcaagaaaaa 
gaagctatgt 
tatggccccc 
aagaaaaaga 
gcgcccaggc 
accaccgtag 
gaggtgtacg 
tacggaaagc 
agcctaaagc 
cgcggcctaa 
cgccagcgac 
gtccgcgtgc 
cagataccca 
acgtccccgg 
tccaagacct 
cgcccgcgcg 
catccttcca 
gcaactaccc 
gtgctggccc 
ccaacagcgc 
atggccctca 
aggaggggca 
cggcgcgcgt 
gccgcggcga 
tgattaaaaa 
cttggtcctg 
acacggctcg 
tggcgccttc 
gaactatggc 
agagcaaaat 
ggacctggcc 
cgtagaggag 
tccgcgcccc 
ggaggcacta 
gctgggccag 
acctgtgctg 
ccgcgccgcc 
actgaacagc 
agctaacgtg 
gccgccgcgc 
atgcacatct 
cgcgccaccg 
acgcacgacg 
cgtgaggata 
gtgctggaca 
tttaagccct 
tgcgaatggg 
aacgaagacg 
ccttattctg 
cctaaatatg 
gaaactgaaa 
tgttacggtt 
caacaaaatg 
accgcaggca 
atagaaaccc 
gaactaatgg 
tttattggtc 
tcgcagttga 
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atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 
gtcggtgttg gagatctgca ccacatttcg 
gctagactgc tccttcagcg cgcgctgccc 
gtgctcctta tttatcataa tgcttccgtg 
gcagcggtgc agccacaacg cgcagcccgt 
aaacgactgc aggtacgcct gcaggaatcg 
ggtgaaggtc agctgcaacc cgcggtgctc 
cagagcttcc acttggtcag gcagtagttt 
gtacttgtcc atcagcgcgc gcgcagcctc 
cacactcagc gggttcatca ccgtaatttc 
ctcttgcgtc cgcataccac gcgccactgg 
cttacctcct ttgccatgct tgattagcac 
cgccacatct tctctttctt cctcgctgtc 
gggcttggga gaagggcgct tctttttctt 
ggtcgatggc cgcgggctgg gtgtgcgcgg 



cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 204 60 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 207 60 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacccaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 22740 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
gccccaccgg ttcttcacga tcttggcctt 22980 
gttttcgctc gtcacatcca tttcaatcac 23040 
tagacactta agctcgcctt cgatctcagc 23100 
gggctcgtga tgcttgtagg tcacctctgc 23160 
ccccatcatc gtcacaaagg tcttgttgct 23220 
ctcgttcagc caggtcttgc atacggccgc 23280 
gaagttcgcc tttagatcgt tatccacgtg 23340 
catgcccttc tcccacgcag acacgatcgg 23400 
actttccgct tcgctgggct cttcctcttc 23460 
gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cggtgggttg ctgaaaccca ccatttgtag 23580 
cacgattacc tctggtgatg gcgggcgctc 23640 
cttgggcgca atggccaaat ccgccgccga 23700 
caccagcgcg tcttgtgatg agtcttcctc 237 60 
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gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 2514 0 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 264 00 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 264 60 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 2724 0 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
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gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27 960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 2814 0 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctaga agtcaggctt 28500 
cctggatgtc agcatctgac tttggccagc acctgtcccg cggatttgtt ccagtccaac 28560 
tacagcgacc caccctaaca gagatgacca acacaaccaa cgcggccgcc gctaccggac 28620 
ttacatctac cacaaataca ccccaagttt ctgcctttgt caataactgg gataacttgg 28680 
gcatgtggtg gttctccata gcgcttatgt ttgtatgcct tattattatg tggctcatct 28740 
gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag tcccatcatt gtgctacacc 28800 
caaacaatga tggaatccat agattggacg gactgaaaca catgttcttt tctcttacag 28860 
tatgattaaa tgagatctag aaatggacgg aattattaca gagcagcgcc tgctagaaag 28920 
acgcagggca gcggccgagc aacagcgcat gaatcaagag ctccaagaca tggttaactt 28980 
gcaccagtgc aaaaggggta tcttttgtct ggtaaagcag gccaaagtca cctacgacag 29040 
taataccacc ggacaccgcc tcagctacaa gttgccaacc aagcgtcaga aattggtggt 29100 
catggtggga gaaaagccca ttaccataac tcagcactcg gtagaaaccg aaggctgcat 29160 
tcactcacct tgtcaaggac ctgaggatct ctgcaccctt attaagaccc tgtgcggtct 29220 
caaagatctt attcccttta actaataaaa aaaaataata aagcatcact tacttaaaat 29280 
cagttagcaa atttctgtcc agtttattca gcagcacctc cttgccctcc tcccagctct 29340 
ggtattgcag cttcctcctg gctgcaaact ttctccacaa tctaaatgga atgtcagttt 29400 
cctcctgttc ctgtccatcc gcacccacta tcttcatgtt gttgcagatg aagcgcgcaa 294 60 
gaccgtctga agataccttc aaccccgtgt atccatatga cacggaaacc ggtcctccaa 29520 
ctgtgccttt tcttactcct ccctttgtat cccccaatgg gtttcaagag agtccccctg 29580 
gggtactctc tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 29640 
aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa aatgtaacca 29700 
ctgtgagccc acctctcaaa aaaaccaagt caaacataaa cctggaaata tctgcacccc 29760 
tcacagttac ctcagaagcc ctaactgtgg ctgccgccgc acctctaatg gtcgcgggca 29820 
acacactcac catgcaatca caggccccgc taaccgtgca cgactccaaa cttagcattg 29880 
ccacccaagg acccctcaca gtgtcagaag gaaagctagc cctgcaaaca tcaggccccc 29940 
tcaccaccac cgatagcagt acccttacta tcactgcctc accccctcta actactgcca 30000 
ctggtagctt gggcattgac ttgaaagagc ccatttatac acaaaatgga aaactaggac 30060 
taaagtacgg ggctcctttg catgtaacag acgacctaaa cactttgacc gtagcaactg 30120 
gtccaggtgt gactattaat aatacttcct tgcaaactaa agttactgga gccttgggtt 30180 
ttgattcaca aggcaatatg caacttaatg tagcaggagg actaaggatt gattctcaaa 30240 
acagacgcct tatacttgat gttagttatc cgtttgatgc tcaaaaccaa ctaaatctaa 30300 
gactaggaca gggccctctt tttataaact cagcccacaa cttggatatt aactacaaca 30360 
aaggccttta cttgtttaca gcttcaaaca attccaaaaa gcttgaggtt aacctaagca 30420 
ctgccaaggg gttgatgttt gacgctacag ccatagccat taatgcagga gatgggcttg 30480 
aatttggttc acctaatgca ccaaacacaa atcccctcaa aacaaaaatt ggccatggcc 30540 
tagaatttga ttcaaacaag gctatggttc ctaaactagg aactggcctt agttttgaca 30600 
gcacaggtgc cattacagta ggaaacaaaa ataatgataa gctaactttg tggaccacac 30660 
cagctccatc tcctaactgt agactaaatg cagagaaaga tgctaaactc actttggtct 30720 
taacaaaatg tggcagtcaa atacttgcta cagtttcagt tttggctgtt aaaggcagtt 30780 
tggctccaat atctggaaca gttcaaagtg ctcatcttat tataagattt gacgaaaatg 30840 
gagtgctact aaacaattcc ttcctggacc cagaatattg gaactttaga aatggagatc 30900 
ttactgaagg cacagcctat acaaacgctg ttggatttat gcctaaccta tcagcttatc 30960 
caaaatctca cggtaaaact gccaaaagta acattgtcag tcaagtttac ttaaacggag 31020 
acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa acaggagaca 31080 
caactccaag tgcatactct atgtcatttt catgggactg gtctggccac aactacatta 31140 
atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa taaagaatcg 31200 
tttgtgttat gtttcaacgt gtttattttt caattgcaga aaatttcaag tcatttttca 31260 
ttcagtagta tagccccacc accacatagc ttatacagat caccgtacct taatcaaact 31320 
cacagaaccc tagtattcaa cctgccacct ccctcccaac acacagagta cacagtcctt 31380 
tctccccggc tggccttaaa aagcatcata tcatgggtaa cagacatatt cttaggtgtt 31440 
atattccaca cggtttcctg tcgagccaaa cgctcatcag tgatattaat aaactccccg 31500 
ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
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cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat tcgcaccgcc 31740 

cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc acttaaatca 31800 

gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg caaggcgctg 31860 

tatccaaagc tcatgg'cggg gaccacagaa cccacgtggc catcatacca caagcgcagg 31920 

tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc ttttggcatg 31980 

ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc gccatccacc 32040 

accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag ggaaccggga 32100 

ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat gctcgtcatg 32160 

atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat tacaagctcc 32220 

tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt aaatcccaca 32280 

ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc 32340 

agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg aggtagacga 32400 

tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca 324 60 

aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg cgtgacaaac 32520 

agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt agtatatcca 32580 

ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc cttcatgcgc 32640 

cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac ctacacattc 32700 

gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt tttttttttt 32760 

attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg cgctcccctc 32820 

cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta agatgttgca 32880 

caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg ctaaaccctt 32940 

cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa taattctcat 33000 

ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg gccattgtaa 33060 

aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg attgcaaaaa 33120 

ttcaggttcc tcacagacct gtataagatt caaaagcgga acattaacaa aaataccgcg 33180 

atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg cacggaccag 33240 

cgcggccact tccccgccag gaaccttgac aaaagaaccc acactgatta tgacacgcat 33300 

actcggagct atgctaacca gcgtagcccc gatgtaagct ttgttgcatg ggcggcgata 33360 

taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 334 20 

cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 334 80 

ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 3354 0 

aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 33600 

acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 33660 

caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 33720 

ttgattcatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata catacccgca 33780 

ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata ggagagaaaa 33840 

acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc cgctccagaa 33900 

caacatacag cgcttcacag cggcagccta acagtcagcc ttaccagtaa aaaagaaaac 33960 

ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 34020 

ccaagtgcag agcgagtata tataggacta aaaaatgacg taacggttaa agtccacaaa 34080 

aaacacccag aaaaccgcac gcgaacctac gcccagaaac gaaagccaaa aaacccacaa 34140 

cttcctcaaa tcgtcacttc cgttttccca cgttacgtaa cttcccattt taagaaaact 34200 

acaattccca acacatacaa gttactccgc cctaaaacct acgtcacccg ccccgttccc 34260 

acgccccgcg ccacgtcaca aactccaccc cctcattatc atattggctt caatccaaaa 34320 

taaggtatat tattgatgat g 34341 

<210> 3 
<211> 33699 
<212> DNA 

<213> Adenovirus subgroup C 



60 



<400> 3 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 

ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 

gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 

gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 

taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 

agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 

gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 

cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 

tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 

tccgacaccg ggactgaaaa tgagacatat tatctgccac ggaggtgtta ttaccgaaga 600 

aatggccgcc agtcttttgg accagctgat cgaagaggta ctggctgata atcttccacc 660 

tcctagccat tttgaaccac ctacccttca cgaactgtat gatttagacg tgacggcccc 720 

cgaagatccc aacgaggagg cggtttcgca gatttttccc gactctgtaa tgttggcggt 780 

gcaggaaggg attgacttac tcacttttcc gccggcgccc ggttctccgg agccgcctca 840 
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cctttcccgg cagcccgagc agccggagca 
ccttgtaccg gaggtgatcg atcttacctg 
cgaggatgaa gagggtgagg actttgtgtt 
caggtcttgt cattatcacc ggaggaatac 
ctatatgagg acctgtggca tgtttgtcta 
tagagtggtg ggtttggtgt ggtaattttt 
gaattttgta ttgtgatttt tttaaaaggt 
ccagaaccgg agcctgcaag acctacccgc 
cgcccgacat cacctgtgtc tagagaatgc 
ccttctaaca cacctcctga gatacacccg 
gccgtgagag ttggtgggcg tcgccaggct 
cctgggcaac ctttggactt gagctgtaaa 
ttgcgtgtgt ggttaacgcc tttgtttgct 
gagataatgt ttaacttgca tggcgtgtta 
cgccgtgggc taatcttggt tacatctgac 
ttttctgctg tgcgtaactt gctggaacag 
tttctgtggg gctcatccca ggcaaagtta 
gaatttgaag agcttttgaa atcctgtggt 
caggcgcttt tccaagagaa ggtcatcaag 
gcggctgctg ttgctttttt gagttttata 
agcggggggt acctgctgga ttttctggcc 
aagaatcgcc tgctactgtt gtcttccgtc 
cagcagcagc aggaggaagc caggcggcgg 
gccggcctgg accctcggga atgaatgttg 
gacgcatttt gacaattaca gaggatgggc 
'gggcttgtga ggctacagag gaggctagga 
gtcctgagtg tattactttt caacagatca 
tggcgcagaa gtattccata gagcagctga 
ttgaggaggc tattagggta tatgcaaagg 
tcagcaaact tgtaaatatc aggaattgtt 
agatagatac ggaggatagg gtggccttta 
tgcttggcat ggacggggtg gttattatga 
gtacggtttt cctggccaat accaacctta 
acaatacctg tgtggaagcc tggaccgatg 
gctggaaggg ggtggtgtgt cgccccaaaa 
aaaggtgtac cttgggtatc ctgtctgagg 
ccgactgtgg ttgcttcatg ctagtgaaaa 
gtggcaactg cgaggacagg gcctctcaga 
tgctgaagac cattcacgta gccagccact 
acatactgac ccgctgttcc ttgcatttgg 
aatgcaattt gagtcacact aagatattgc 
tgaacggggt gtttgacatg accatgaaga 
gcaccaggtg cagaccctgc gagtgtggcg 
tggatgtgac cgaggagctg aggcccgatc 
ttggctctag cgatgaagat acagattgag 
tgggaaagaa tatataaggt gggggtctta 
ccgccgccat gagcaccaac tcgtttgatg 
gcatgccccc atgggccggg gtgcgtcaga 
ccgtcctgcc cgcaaactct actaccttga 
agactgcagc ctccgccgcc gcttcagccg 
actttgcttt cctgagcccg cttgcaagca 
acaagttgac ggctcttttg gcacaattgg 
ctcagcagct gttggatctg cgccagcagg 
atgcggttta aaacataaat aaaaaaccag 
cttgctgtct ttatttaggg gttttgcgcg 
cgttgagggt cctgtgtatt ttttccagga 
acatgggcat aagcccgtct ctggggtgga 
gggtggtgtt gtagatgatc cagtcgtagc 
ctttcagtag caagctgatt gccaggggca 
taagctggga tgggtgcata cgtggggata 
tggctatgtt cccagccata tccctccggg 
tgtatccggt gcacttggga aatttgtcat 
tggagacgcc cttgtgacct ccaagatttt 
gcccacgggc ggcggcctgg gcgaagatat 
ccaggatgag atcgtcatag gccattttta 
gtataatggt tccatccggc ccaggggcgt 



gagagccttg ggtccggttt ctatgccaaa 900 
ccacgaggct ggctttccac ccagtgacga 960 
agattatgtg gagcaccccg ggcacggttg 1020 
gggggaccca gatattatgt gttcgctttg 1080 
cagtaagtga aaattatggg cagtgggtga 1140 
tttttaattt ttacagtttt gtggtttaaa 1200 
cctgtgtctg aacctgagcc tgagcccgag 1260 
cgtcctaaaa tggcgcctgc tatcctgaga 1320 
aatagtagta cggatagctg tgactccggt 1380 
gtggtcccgc tgtgccccat taaaccagtt 1440 
gtggaatgta tcgaggactt gcttaacgag 1500 
cgccccaggc cataaggtgt aaacctgtga 1560 
gaatgagttg atgtaagttt aataaagggt 1620 
aatggggcgg ggcttaaagg gtatataatg 1680 
ctcatggagg cttgggagtg tttggaagat 1740 
agctctaaca gtacctcttg gttttggagg 1800 
gtctgcagaa ttaaggagga ttacaagtgg 1860 
gagctgtttg attctttgaa tctgggtcac 1920 
actttggatt tttccacacc ggggcgcgct 1980 
aaggataaat ggagcgaaga aacccatctg 204 0 
atgcatctgt ggagagcggt tgtgagacac 2100 
cgcccggcga taataccgac ggaggagcag 2160 
cggcaggagc agagcccatg gaacccgaga 2220 
tacaggtggc tgaactgtat ccagaactga 2280 
aggggctaaa gggggtaaag agggagcggg 2340 
atctagcttt tagcttaatg accagacacc 2400 
aggataattg cgctaatgag cttgatctgc 2460 
ccacttactg gctgcagcca ggggatgatt 2520 
tggcacttag gccagattgc aagtacaaga 2580 
gctacatttc tgggaacggg gccgaggtgg 2640 
gatgtagcat gataaatatg tggccggggg 2700 
atgtaaggtt tactggcccc aattttagcg 2760 
tcctacacgg tgtaagcttc tatgggttta 2820 
taagggttcg gggctgtgcc ttttactgct 2880 
gcagggcttc aattaagaaa tgcctctttg 2940 
gtaactccag ggtgcgccac aatgtggcct 3000 
gcgtggctgt gattaagcat aacatggtat 3060 
tgctgacctg ctcggacggc aactgtcacc 3120 
ctcgcaaggc ctggccagtg tttgagcata 3180 
gtaacaggag gggggtgttc ctaccttacc 3240 
ttgagcccga gagcatgtcc aaggtgaacc 3300 
tctggaaggt gctgaggtac gatgagaccc 3360 
gtaaacatat taggaaccag cctgtgatgc 3420 
acttggtgct ggcctgcacc cgcgctgagt 3480 
gtactgaaat gtgtgggcgt ggcttaaggg 354 0 
tgtagttttg tatctgtttt gcagcagccg 3600 
gaagcattgt gagctcatat ttgacaacgc 3660 
atgtgatggg ctccagcatt gatggtcgcc 3720 
cctacgagac cgtgtctgga acgccgttgg 3780 
ctgcagccac cgcccgcggg attgtgactg 3840 
gtgcagcttc ccgttcatcc gcccgcgatg 3900 
attctttgac ccgggaactt aatgtcgttt 3960 
tttctgccct gaaggcttcc tcccctccca 4020 
actctgtttg gatttggatc aagcaagtgt 4080 
cgcggtaggc ccgggaccag cggtctcggt 4140 
cgtggtaaag gtgactctgg atgttcagat 4200 
ggtagcacca ctgcagagct tcatgctgcg 4260 
aggagcgctg ggcgtggtgc ctaaaaatgt 4 320 
ggcccttggt gtaagtgttt acaaagcggt 4380 
tgagatgcat cttggactgt atttttaggt 4440 
gattcatgtt gtgcagaacc accagcacag 4500 
gtagcttaga aggaaatgcg tggaagaact 4560 
ccatgcattc gtccataatg atggcaatgg 4620 
ttctgggatc actaacgtca tagttgtgtt 4680 
caaagcgcgg gcggagggtg ccagactgcg 4740 
agttaccctc acagatttgc atttcccacg 4800 



wo 01/04282 



20 



PCT/US00/I8971 



ctttgagttc agatgggggg atcatgtcta cctgcggggc gatgaagaaa acggtttccg 4860 

gggtagggga gatcagctgg gaagaaagca ggttcctgag cagctgcgac ttaccgcagc 4 920 

cggtgggccc gtaaatcaca cctattaccg ggtgcaactg gtagttaaga gagctgcagc 4 980 

tgccgtcatc cctgagcagg ggggccactt cgttaagcat gtccctgact cgcatgtttt 5040 

ccctgaccaa atccgccaga aggcgctcgc cgcccagcga tagcagttct tgcaaggaag 5100 

caaagttttt caacggtttg agaccgtccg ccgtaggcat gcttttgagc gtttgaccaa 5160 

gcagttccag gcggtcccac agctcggtca cctgctctac ggcatctcga tccagcatat 5220 

ctcctcgttt cgcgggttgg ggcggctttc gctgtacggc agtagtcggt gctcgtccag 5280 

acgggccagg gtcatgtctt tccacgggcg cagggtcctc gtcagcgtag tctgggtcac 5340 

ggtgaagggg tgcgctccgg gctgcgcgct ggccagggtg cgcttgaggc tggtcctgct 5400 

ggtgctgaag cgctgccggt cttcgccctg cgcgtcggcc aggtagcatt tgaccatggt 5460 

gtcatagtcc agcccctccg cggcgtggcc cttggcgcgc agcttgccct tggaggaggc 5520 

gccgcacgag gggcagtgca gacttttgag ggcgtagagc ttgggcgcga gaaataccga 5580 

ttccggggag taggcatccg cgccgcaggc cccgcagacg gtctcgcatt ccacgagcca 5640 

ggtgagctct ggccgttcgg ggtcaaaaac caggtttccc ccatgctttt tgatgcgttt 5700 

cttacctctg gtttccatga gccggtgtcc acgctcggtg acgaaaaggc tgtccgtgtc 5760 

cccgtataca gacttgagag gcctgtcctc gagcggtgtt ccgcggtcct cctcgtatag 5820 

aaactcggac cactctgaga caaaggctcg cgtccaggcc agcacgaagg aggctaagtg 5880 

ggaggggtag cggtcgttgt ccactagggg gtccactcgc tccagggtgt gaagacacat 594 0 

gtcgccctct tcggcatcaa ggaaggtgat tggtttgtag gtgtaggcca cgtgaccggg 6000 

tgttcctgaa ggggggctat aaaagggggt gggggcgcgt tcgtcctcac tctcttccgc 6060 

atcgctgtct gcgagggcca gctgttgggg tgagtactcc ctctgaaaag cgggcatgac 6120 

ttctgcgcta agattgtcag tttccaaaaa cgaggaggat ttgatattca cctggcccgc 6180 

ggtgatgcct ttgagggtgg ccgcatccat ctggtcagaa aagacaatct ttttgttgtc 6240 

aagcttggtg gcaaacgacc cgtagagggc gttggacagc aacttggcga tggagcgcag 6300 

ggtttggttt ttgtcgcgat cggcgcgctc cttggccgcg atgtttagct gcacgtattc 6360 

gcgcgcaacg caccgccatt cgggaaagac ggtggtgcgc tcgtcgggca ccaggtgcac 6420 

gcgccaaccg cggttgtgca gggtgacaag gtcaacgctg gtggctacct ctccgcgtag 6480 

gcgctcgttg gtccagcaga ggcggccgcc cttgcgcgag cagaatggcg gtagggggtc 6540 

tagctgcgtc tcgtccgggg ggtctgcgtc cacggtaaag accccgggca gcaggcgcgc 6600 

gtcgaagtag tctatcttgc atccttgcaa gtctagcgcc tgctgccatg cgcgggcggc 6660 

aagcgcgcgc tcgtatgggt tgagtggggg accccatggc atggggtggg tgagcgcgga 6720 

ggcgtacatg ccgcaaatgt cgtaaacgta gaggggctct ctgagtattc caagatatgt 6780 

agggtagcat cttccaccgc ggatgctggc gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 

agcgaggagg tcgggaccga ggttgctacg ggcgggctgc tctgctcgga agactatctg 6900 

cctgaagatg gcatgtgagt tggatgatat ggttggacgc tggaagacgt tgaagctggc 6960 

gtctgtgaga cctaccgcgt cacgcacgaa ggaggcgtag gagtcgcgca gcttgttgac 7020 

cagctcggcg gtgacctgca cgtctagggc gcagtagtcc agggtttcct tgatgatgtc 7080 

atacttatcc tgtccctttt ttttccacag ctcgcggttg aggacaaact cttcgcggtc 7140 

tttccagtac tcttggatcg gaaacccgtc ggcctccgaa cggtaagagc ctagcatgta 7200 

gaactggttg acggcctggt aggcgcagca tcccttttct acgggtagcg cgtatgcctg 7260 

cgcggccttc cggagcgagg tgtgggtgag cgcaaaggtg tccctgacca tgactttgag 7320 

gtactggtat ttgaagtcag tgtcgtcgca tccgccctgc tcccagagca aaaagtccgt 7380- 

gcgctttttg gaacgcggat ttggcagggc gaaggtgaca tcgttgaaga gtatctttcc 7440 

cgcgcgaggc ataaagttgc gtgtgatgcg gaagggtccc ggcacctcgg aacggttgtt 7500 

aattacctgg gcggcgagca cgatctcgtc aaagccgttg atgttgtggc ccacaatgta 7560 

aagttccaag aagcgcggga tgcccttgat ggaaggcaat tttttaagtt cctcgtaggt 7620 

gagctcttca ggggagctga gcccgtgctc tgaaagggcc cagtctgcaa gatgagggtt 7 680 

ggaagcgacg aatgagctcc acaggtcacg ggccattagc atttgcaggt ggtcgcgaaa 7740 

ggtcctaaac tggcgaccta tggccatttt ttctggggtg atgcagtaga aggtaagcgg 7800 

gtcttgttcc cagcggtccc atccaaggtt cgcggctagg tctcgcgcgg cagtcactag 7860 

aggctcatct ccgccgaact tcatgaccag catgaagggc acgagctgct tcccaaaggc 7920 

ccccatccaa gtataggtct ctacatcgta ggtgacaaag agacgctcgg tgcgaggatg 7980 

cgagccgatc gggaagaact ggatctcccg ccaccaattg gaggagtggc tattgatgtg 8040 

gtgaaagtag aagtccctgc gacgggccga acactcgtgc tggcttttgt aaaaacgtgc 8100 

gcagtactgg cagcggtgca cgggctgtac atcctgcacg aggttgacct gacgaccgcg 8160 

cacaaggaag cagagtggga atttgagccc ctcgcctggc gggtttggct ggtggtcttc 8220 

tacttcggct gcttgtcctt gaccgtctgg ctgctcgagg ggagttacgg tggatcggac 8280 

caccacgccg cgcgagccca aagtccagat gtccgcgcgc ggcggtcgga gcttgatgac 8340 

aacatcgcgc agatgggagc tgtccatggt ctggagctcc cgcggcgtca ggtcaggcgg 8400 

gagctcctgc aggtttacct cgcatagacg ggtcagggcg cgggctagat ccaggtgata 8460 

cctaatttcc aggggctggt tggtggcggc gtcgatggct tgcaagaggc cgcatccccg 8520 

cggcgcgact acggtaccgc gcggcgggcg gtgggccgcg ggggtgtcct tggatgatgc 8580 

atctaaaagc ggtgacgcgg gcgagccccc ggaggtaggg ggggctccgg acccgccggg 8640 

agagggggca ggggcacgtc ggcgccgcgc gcgggcagga gctggtgctg cgcgcgtagg 8700 

ttgctggcga acgcgacgac gcggcggttg atctcctgaa tctggcgcct ctgcgtgaag 8760 
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acgacgggcc cggtgagctt gagcctgaaa gagagttcga cagaatcaat ttcggtgtcg 8820 
ttgacggcgg cctggcgcaa aatctcctgc acgtctcctg agttgtcttg ataggcgatc 8880 
tcggccatga actgctcgat ctcttcctcc tggagatctc cgcgtccggc tcgctccacg 8940 
gtggcggcga ggtcgttgga aatgcgggcc atgagctgcg agaaggcgtt gaggcctccc 9000 
tcgttccaga cgcggctgta gaccacgccc ccttcggcat cgcgggcgcg catgaccacc 9060 
tgcgcgagat tgagctccac gtgccgggcg aagacggcgt agtttcgcag gcgctgaaag 9120 
aggtagttga gggtggtggc ggtgtgttct gccacgaaga agtacataac ccagcgtcgc 9180 
aacgtggatt cgttgatatc ccccaaggcc tcaaggcgct ccatggcctc gtagaagtcc 9240 
acggcgaagt tgaaaaactg ggagttgcgc gccgacacgg ttaactcctc ctccagaaga 9300 
cggatgagct cggcgacagt gtcgcgcacc tcgcgctcaa aggctacagg ggcctcttct 9360 
tcttcttcaa tctcctcttc cataagggcc tccccttctt cttcttctgg cggcggtggg 9420 
ggagggggga cacggcggcg acgacggcgc accgggaggc ggtcgacaaa gcgctcgatc 9480 
atctccccgc ggcgacggcg catggtctcg gtgacggcgc ggccgttctc gcgggggcgc 9540 
agttggaaga cgccgcccgt catgtcccgg ttatgggttg gcggggggct gccatgcggc 9600 
agggatacgg cgctaacgat gcatctcaac aattgttgtg taggtactcc gccgccgagg 9660 
gacctgagcg agtccgcatc gaccggatcg gaaaacctct cgagaaaggc gtctaaccag 9720 
tcacagtcgc aaggtaggct gagcaccgtg gcgggcggca gcgggcggcg gtcggggttg 9780 
tttctggcgg aggtgctgct gatgatgtaa ttaaagtagg cggtcttgag acggcggatg 9840 
gtcgacagaa gcaccatgtc cttgggtccg gcctgctgaa tgcgcaggcg gtcggccatg 9900 
ccccaggctt cgttttgaca tcggcgcagg tctttgtagt agtcttgcat gagcctttct 9960 
accggcactt cttcttctcc ttcctcttgt cctgcatctc ttgcatctat cgctgcggcg 10020 
gcggcggagt ttggccgtag gtggcgccct cttcctccca tgcgtgtgac cccgaagccc 10080 
ctcatcggct gaagcagggc taggtcggcg acaacgcgct cggctaatat ggcctgctgc 1014 0 
acctgcgtga gggtagactg gaagtcatcc atgtccacaa agcggtggta tgcgcccgtg 10200 
ttgatggtgt aagtgcagtt ggccataacg gaccagttaa cggtctggtg acccggctgc 10260 
gagagctcgg tgtacctgag acgcgagtaa gccctcgagt caaatacgta gtcgttgcaa 10320 
gtccgcacca ggtactggta tcccaccaaa aagtgcggcg gcggctggcg gtagaggggc 10380 
cagcgtaggg tggccggggc tccgggggcg agatcttcca acataaggcg atgatatccg 10440 
tagatgtacc tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg cggaaagtcg 10500 
cggacgcggt tccagatgtt gcgcagcggc aaaaagtgct ccatggtcgg gacgctctgg 10560 
ccggtcaggc gcgcgcaatc gttgacgctc tagaccgtgc aaaaggagag cctgtaagcg 10620 
ggcactcttc cgtggtctgg tggataaatt cgcaagggta tcatggcgga cgaccggggt 10680 
tcgagccccg tatccggccg tccgccgtga tccatgcggt taccgcccgc gtgtcgaacc 10740 
caggtgtgcg acgtcagaca acgggggagt gctccttttg gcttccttcc aggcgcggcg 10800 
gctgctgcgc tagctttttt ggccactggc cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gcgaaagcat taagtggctc gctccctgta gccggagggt tattttccaa gggttgagtc 10920 
gcgggacccc cggttcgagt ctcggaccgg ccggactgcg gcgaacgggg gtttgcctcc 10980 
ccgtcatgca agaccccgct tgcaaattcc tccggaaaca gggacgagcc ccttttttgc 11040 
ttttcccaga tgcatccggt gctgcggcag atgcgccccc ctcctcagca gcggcaagag 11100 
caagagcagc ggcagacatg cagggcaccc tcccctcctc ctaccgcgtc aggaggggcg 11160 
acatccgcgg ttgacgcggc agcagatggt gattacgaac ccccgcggcg ccgggcccgg 11220 
cactacctgg acttggagga gggcgagggc ctggcgcggc taggagcgcc ctctcctgag 11280 
cggtacccaa gggtgcagct gaagcgtgat acgcgtgagg cgtacgtgcc gcggcagaac 11340 
ctgtttcgcg accgcgaggg agaggagccc gaggagatgc gggatcgaaa gttccacgca 11400 
gggcgcgagc tgcggcatgg cctgaatcgc gagcggttgc tgcgcgagga ggactttgag 114 60 
cccgacgcgc gaaccgggat tagtcccgcg cgcgcacacg tggcggccgc cgacctggta 11520 
accgcatacg agcagacggt gaaccaggag attaactttc aaaaaagctt taacaaccac 11580 
gtgcgtacgc ttgtggcgcg cgaggaggtg gctataggac tgatgcatct gtgggacttt 11640 
gtaagcgcgc tggagcaaaa cccaaatagc aagccgctca tggcgcagct gttccttata 11700 
gtgcagcaca gcagggacaa cgaggcattc agggatgcgc tgctaaacat agtagagccc 11760 
gagggccgct ggctgctcga tttgataaac atcctgcaga gcatagtggt gcaggagcgc 11820 
agcttgagcc tggctgacaa ggtggccgcc atcaactatt ccatgcttag cctgggcaag 11880 
ttttacgccc gcaagatata ccatacccct tacgttccca tagacaagga ggtaaagatc 11940 
gaggggttct acatgcgcat ggcgctgaag gtgcttacct tgagcgacga cctgggcgtt 12000 
tatcgcaacg agcgcatcca caaggccgtg agcgtgagcc ggcggcgcga gctcagcgac 12060 
cgcgagctga tgcacagcct gcaaagggcc ctggctggca cgggcagcgg cgatagagag 12120 
gccgagtcct actttgacgc gggcgctgac ctgcgctggg ccccaagccg acgcgccctg 12180 
gaggcagctg gggccggacc tgggctggcg gtggcacccg cgcgcgctgg caacgtcggc 12240 
ggcgtggagg aatatgacga ggacgatgag tacgagccag aggacggcga gtactaagcg 12300 
gtgatgtttc tgatcagatg atgcaagacg caacggaccc ggcggtgcgg gcggcgctgc 12360 
agagccagcc gtccggcctt aactccacgg acgactggcg ccaggtcatg gaccgcatca 12420 
tgtcgctgac tgcgcgcaat cctgacgcgt tccggcagca gccgcaggcc aaccggctct 12480 
ccgcaattct ggaagcggtg gtcccggcgc gcgcaaaccc cacgcacgag aaggtgctgg 12540 
cgatcgtaaa cgcgctggcc gaaaacaggg ccatccggcc cgacgaggcc ggcctggtct 12600 
acgacgcgct gcttcagcgc gtggctcgtt acaacagcgg caacgtgcag accaacctgg 12660 
accggctggt gggggatgtg cgcgaggccg tggcgcagcg tgagcgcgcg cagcagcagg 12720 
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gcaacctggg ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc 12780 
cgcggggaca ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga 12840 
caccgcaaag tgaggtgtac cagtctgggc cagactattt tttccagacc agtagacaag 12900 
gcctgcagac cgtaaacctg agccaggctt tcaaaaactt gcaggggctg tggggggtgc 12960 
gggctcccac aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt 13020 
tgctgctgct aatagcgccc ttcacggaca gtggcagcgt gccccgggac acatacctag 13080 
gtcacttgct gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt 13140 
tccaggagat tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg 13200 
caaccctaaa ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa 13260 
acagcgagga ggagcgcatt ttgcgctacg tgcagcagag cgtgagcctt aacctgatgc 13320 
gcgacggggt aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca 13380 
tgtatgcctc aaaccggccg tttatcaacc gcctaatgga ctacttgcat cgcgcggccg 13440 
ccgtgaaccc cgagtatttc accaatgcca tcttgaaccc gcactggcta ccgccccctg 13500 
gtttctacac cgggggattc gaggtgcccg agggtaacga tggattcctc tgggacgaca 13560 
tagacgacag cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc 13620 
aggcagaggc ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag 13680 
gcgctgcggc cccgcggtca gatgctagta gcccatttcc aagcttgata gggtctctta 13740 
ccagcactcg caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc 13800 
tgctgcagcc gcagcgcgaa aaaaacctgc ctccggcatt tcccaacaac gggatagaga 13860 
gcctagtgga caagatgagt agatggaaga cgtacgcgca ggagcacagg gacgtgccag 13920 
gcccgcgccc gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg 13980 
acgatgactc ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg 14040 
cgcaccttcg ccccaggctg gggagaatgt tttaaaaaaa aaaaagcatg atgcaaaata 14100 
aaaaactcac caaggccatg gcaccgagcg ttggttttct tgtattcccc ttagtatgcg 14160 
gcgcgcggcg atgtatgagg aaggtcctcc tccctcctac gagagtgtgg tgagcgcggc 14220 
gccagtggcg gcggcgctgg gttctccctt cgatgctccc ctggacccgc cgtttgtgcc 14280 
tccgcggtac ctgcggccta ccggggggag aaacagcatc cgttactctg agttggcacc 14340 
cctattcgac accacccgtg tgtacctggt ggacaacaag tcaacggatg tggcatccct 14400 
gaactaccag aacgaccaca gcaactttct gaccacggtc attcaaaaca atgactacag 14460 
cccgggggag gcaagcacac agaccatcaa tcttgacgac cggtcgcact ggggcggcga 14520 
cctgaaaacc atcctgcata ccaacatgcc aaatgtgaac gagttcatgt ttaccaataa 14580 
gtttaaggcg cgggtgatgg tgtcgcgctt gcctactaag gacaatcagg tggagctgaa 14640 
atacgagtgg gtggagttca cgctgcccga gggcaactac tccgagacca tgaccataga 14700 
ccttatgaac aacgcgatcg tggagcacta cttgaaagtg ggcagacaga acggggttct 14760 
ggaaagcgac atcggggtaa agtttgacac ccgcaacttc agactggggt ttgaccccgt 14820 
cactggtctt gtcatgcctg gggtatatac aaacgaagcc ttccatccag acatcatttt 14880 
gctgccagga tgcggggtgg acttcaccca cagccgcctg agcaacttgt tgggcatccg 14940 
caagcggcaa cccttccagg agggctttag gatcacctac gatgatctgg agggtggtaa 15000 
cattcccgca ctgttggatg tggacgccta ccaggcgagc ttgaaagatg acaccgaaca 15060 
gggcgggggt ggcgcaggcg gcagcaacag cagtggcagc ggcgcggaag agaactccaa 15120 
cgcggcagcc gcggcaatgc agccggtgga ggacatgaac gatcatgcca ttcgcggcga 15180 
cacctttgcc acacgggctg aggagaagcg cgctgaggcc gaagcagcgg ccgaagctgc 15240 
cgcccccgct gcgcaacccg aggtcgagaa gcctcagaag aaaccggtga tcaaacccct 15300 
gacagaggac agcaagaaac gcagttacaa cctaataagc aatgacagca ccttcaccca 15360 
gtaccgcagc tggtaccttg catacaacta cggcgaccct cagaccggaa tccgctcatg 15420 
gaccctgctt tgcactcctg acgtaacctg cggctcggag caggtctact ggtcgttgcc 15480 
agacatgatg caagaccccg tgaccttccg ctccacgcgc cagatcagca actttccggt 15540 
ggtgggcgcc gagctgttgc ccgtgcactc caagagcttc tacaacgacc aggccgtcta 15600 
ctcccaactc atccgccagt ttacctctct gacccacgtg ttcaatcgct ttcccgagaa 15660 
ccagattttg gcgcgcccgc cagcccccac catcaccacc gtcagtgaaa acgttcctgc 15720 
tctcacagat cacgggacgc taccgctgcg caacagcatc ggaggagtcc agcgagtgac 157 80 
cattactgac gccagacgcc gcacctgccc ctacgtttac aaggccctgg gcatagtctc 15840 
gccgcgcgtc ctatcgagcc gcactttttg agcaagcatg tccatcctta tatcgcccag 15900 
caataacaca ggctggggcc tgcgcttccc aagcaagatg tttggcgggg ccaagaagcg 15960 
ctccgaccaa cacccagtgc gcgtgcgcgg gcactaccgc gcgccctggg gcgcgcacaa 16020 
acgcggccgc actgggcgca ccaccgtcga tgacgccatc gacgcggtgg tggaggaggc 16080 
gcgcaactac acgcccacgc cgccaccagt gtccacagtg gacgcggcca ttcagaccgt 16140 
ggtgcgcgga gcccggcgct atgctaaaat gaagagacgg cggaggcgcg tagcacgtcg 16200 
ccaccgccgc cgacccggca ctgccgccca acgcgcggcg gcggccctgc ttaaccgcgc 16260 
acgtcgcacc ggccgacggg cggccatgcg ggccgctcga aggctggccg cgggtattgt 16320 
cactgtgccc cccaggtcca ggcgacgagc ggccgccgca gcagccgcgg ccattagtgc 16380 
tatgactcag ggtcgcaggg gcaacgtgta ttgggtgcgc gactcggtta gcggcctgcg 16440 
cgtgcccgtg cgcacccgcc ccccgcgcaa ctagattgca agaaaaaact acttagactc 16500 
gtactgttgt atgtatccag cggcggcggc gcgcaacgaa gctatgtcca agcgcaaaat 16560 
caaagaagag atgctccagg tcatcgcgcc ggagatctat ggccccccga agaaggaaga 16620 
gcaggattac aagccccgaa agctaaagcg ggtcaaaaag aaaaagaaag atgatgatga 16680 
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tgaacttgac gacgaggtgg aactgctgca 
gaaaggtcga cgcgtaaaac gtgttttgcg 
tgagcgctcc acccgcacct acaagcgcgt 
gcttgagcag gccaacgagc gcctcgggga 
gctggcgttg ccgctggacg agggcaaccc 
gcaggtgctg cccgcgcttg caccgtccga 
tgacttggca cccaccgtgc agctgatggt 
ggaaaaaatg accgtggaac ctgggctgga 
ggtggcgccg ggactgggcg tgcagaccgt 
cagtattgcc accgccacag agggcatgga 
ggcggatgcc gcggtgcagg cggtcgctgc 
aacggacccg tggatgtttc gcgtttcagc 
cggcgccgcc agcgcgctac tgcccgaata 
cggctatcgt ggctacacct accgccccag 
cactggaacc cgccgccgcc gtcgccgtcg 
cagggtggct cgcgaaggag gcaggaccct 
catcgtttaa aagccggtct ttgtggttct 
tttcccggtg ccgggattcc gaggaagaat 
cctgacgggc ggcatgcgtc gtgcgcacca 
gcgcggcggt atcctgcccc tccttattcc 
cggaattgca tccgtggcct tgcaggcgca 
gaaaaatcaa aataaaaagt ctggactctc 
gaatggaaga catcaacttt gcgtctctgg 
gaaactggca agatatcggc accagcaata 
tgtggagcgg cattaaaaat ttcggttcca 
acagcagcac aggccagatg ctgagggata 
tggtagatgg cctggcctct ggcattagcg 
aaaataagat taacagtaag cttgatcccc 
tggagacagt gtctccagag gggcgtggcg 
ctctggtgac gcaaatagac gagcctccct 
ccaccacccg tcccatcgcg cccatggcta 
cgctggacct gcctcccccc gccgacaccc 
ccgttgttgt aacccgtcct agccgcgcgt 
cgttgcggcc cgtagccagt ggcaactggc 
gggtgcaatc cctgaagcgc cgacgatgct 
atgtatgcgt ccatgtcgcc gccagaggag 
gatggctacc ccttcgatga tgccgcagtg 
ctcggagtac ctgagccccg ggctggtgca 
cctgaataac aagtttagaa accccacggt 
gtcccagcgt ttgacgctgc ggttcatccc 
caaggcgcgg ttcaccctag ctgtgggtga 
ctttgacatc cgcggcgtgc tggacagggg 
ctacaacgcc ctggctccca agggtgcccc 
tgctcttgaa ataaacctag aagaagagga 
agctgagcag caaaaaactc acgtatttgg 
aaaggagggt attcaaatag gtgtcgaagg 
tcaacctgaa cctcaaatag gagaatctca 
tgggagagtc cttaaaaaga ctaccccaat 
cacaaatgaa aatggagggc aaggcattct 
tcaagtggaa atgcaatttt tctcaactac 
gactcctaaa gtggtattgt acagtgaaga 
ttcttacatg cccactatta aggaaggtaa 
gcccaacagg cctaattaca ttgcttttag 
cagcacgggt aatatgggtg ttctggcggg 
tttgcaagac agaaacacag agctttcata 
aaccaggtac ttttctatgt ggaatcaggc 
tattgaaaat catggaactg aagatgaact 
gattaataca gagactctta ccaaggtaaa 
aaaagatgct acagaatttt cagataaaaa 
ggaaatcaat ctaaatgcca acctgtggag 
tttgcccgac aagctaaagt acagtccttc 
ctacgactac atgaacaagc gagtggtggc 
tggagcacgc tggtcccttg actatatgga 
tgctggcctg cgctaccgcc caatgttgct 
ccaggtgcct cagaagttct ttgccattaa 
ctacgagtgg aacttcagga aggatgttaa 



cgctaccgcg cccaggcgac gggtacagtg 16740 
acccggcacc accgtagtct ttacgcccgg 16800 
gtatgatgag gtgtacggcg acgaggacct 16860 
gtttgcctac ggaaagcggc ataaggacat 1-6920 
aacacctagc ctaaagcccg taacactgca 16980 
agaaaagcgc ggcctaaagc gcgagtctgg 17040 
acccaagcgc cagcgactgg aagatgtctt 17100 
gcccgaggtc cgcgtgcggc caatcaagca 17160 
ggacgttcag atacccacta ccagtagcac 17220 
gacacaaacg tccccggttg cctcagcggt 17280 
ggccgcgtcc aagacctcta cggaggtgca 17340 
cccccggcgc ccgcgcggtt cgaggaagta 17400 
tgccctacat ccttccattg cgcctacccc 174 60 
aagacgagca actacccgac gccgaaccac 17S20 
ccagcccgtg ctggccccga tttccgtgcg 17580 
ggtgctgcca acagcgcgct accaccccag 17640 
tgcagatatg gccctcacct gccgcctccg 17700 
gcaccgtagg aggggcatgg ccggccacgg 17760 
ccggcggcgg cgcgcgtcgc accgtcgcat 17820 
actgatcgcc gcggcgattg gcgccgtgcc 17880 
gagacactga ttaaaaacaa gttgcatgtg 17940 
acgctcgctt ggtcctgtaa ctattttgta 18000 
ccccgcgaca cggctcgcgc ccgttcatgg 18060 
tgagcggtgg cgccttcagc tggggctcgc 18120 
ccgttaagaa ctatggcagc aaggcctgga 18180 
agttgaaaga gcaaaatttc caacaaaagg 18240 
gggtggtgga cctggccaac caggcagtgc 18300 
gccctcccgt agaggagcct ccaccggccg 18360 
aaaagcgtcc gcgccccgac agggaagaaa 184 20 
cgtacgagga ggcactaaag caaggcctgc 184 80 
ccggagtgct gggccagcac acacccgtaa 18540 
agcagaaacc tgtgctgcca ggcccgaccg 18600 
ccctgcgccg cgccgccagc ggtccgcgat 18660 
aaagcacact gaacagcatc gtgggtctgg 18720 
tctgaatagc taacgtgtcg tatgtgtgtc 18780 
ctgctgagcc gccgcgcgcc cgctttccaa 18840 
gtcttacatg cacatctcgg gccaggacgc 18900 
gtttgcccgc gccaccgaga cgtacttcag 18960 
ggcgcctacg cacgacgtga ccacagaccg 19020 
tgtggaccgt gaggatactg cgtactcgta 19080 
taaccgtgtg ctggacatgg cttccacgta 19140 
ccctactttt aagccctact ctggcactgc 19200 
aaatccttgc gaatgggatg aagctgctac 19260 
cgatgacaac gaagacgaag tagacgagca 19320 
gcaggcgcct tattctggta taaatattac 19380 
tcaaacacct aaatatgccg ataaaacatt 194 40 
gtggtacgaa actgaaatta atcatgcagc 19500 
gaaaccatgt tacggttcat atgcaaaacc 19560 
tgtaaagcaa caaaatggaa agctagaaag 19620 
tgaggcgacc gcaggcaatg gtgataactt 19680 
tgtagatata gaaaccccag acactcatat 19740 
ctcacgagaa ctaatgggcc aacaatctat 19800 
ggacaatttt attggtctaa tgtattacaa 198 60 
ccaagcatcg cagttgaatg ctgttgtaga 19920 
ccagcttttg cttgattcca ttggtgatag 19980 
tgttgacagc tatgatccag atgttagaat 20040 
tccaaattac tgctttccac tgggaggtgt 20100 
acctaaaaca ggtcaggaaa atggatggga 20160 
tgaaataaga gttggaaata attttgccat 20220 
aaatttcctg tactccaaca tagcgctgta 20280 
caacgtaaaa atttctgata acccaaacac 20340 
tcccgggtta gtggactgct acattaacct 204 00 
caacgtcaac ccatttaacc accaccgcaa 204 60 
gggcaatggt cgcratgtgc ccttccacat 20520 
aaacctcctt ctcctgccgg gctcatacac 20580 
catggttctg cagagctccc taggaaatga 20640 
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cctaagggtt 

ccccatggcc 

ccagtccttt 

taccaacgtg 

cacgcgcctt 

ctactctggc 

ggtggccatt 

caacgagttt 

catgaccaaa 

cttctatatc 

catgagccgt 

acaccaacac 

ggcctaccct 

ccagaaaaag 

gtccatgggc 

gctagacatg 

tgaagtcttt 

cctgcgcacg 

acagctgccg 

tgtgggccat 

aagctcgcct 

gcctttgcct 

gaccagcgac 

attgcttctt 

cccaactcgg 

ccccaaactc 

atgctcaaca 

ttcctggagc 

tctttttgtc 

aaatgctttt 

gtttaaaaat 

cgatactggt 

aagttttcac 

atcttgaagt 

cagcactgga 

atcagatccg 

tgccttccca 

aaaaggtgac 

tgcttaaaag 

gaaaactgat 

atctgcacca 

ttcagcgcgc 

atcataatgc 

cacaacgcgc 

tacgcctgca 

tgcaacccgc 

tggtcaggca 

agcgcgcgcg 

ttcatcaccg 

ataccacgcg 

ccatgcttga 

ctttcttcct 

gggcgcttct 

gggctgggtg 

atacgccgcc 

gacgacacgt 

gtttcgcgct 

atggagtcag 

tccaccgatg 

gaggaagtga 

gtaccaacag 

gggcgggggg 

catctgcagc 

ctcgccatag 

cccaaacgcc 

tttgccgtgc 



gacggagcca 

cacaacaccg 

aacgactatc 

cccatatcca 

aagactaagg 

tctataccct 

acctttgact 

gaaattaagc 

gactggttcc 

ccagagagct 

caggtggtgg 

aacaactctg 

gctaacttcc 

tttctttgcg 

gcactcacag 

acttttgagg 

gacgtggtcc 

cccttctcgg 

ccatgggctc 

attttttggg 

gcgccatagt 

ggaacccgca 

tcaagcaggt 

cccccgaccg 

ccgcctgtgg 

ccatggatca 

gtccccaggt 

gccactcgcc 

acttgaaaaa 

atttgtacac 

caaaggggtt 

gtttagtgct 

tccacaggct 

cgcagttggg 

acactatcag 

cgtccaggtc 

aaaagggcgc 

cgtgcccggt 

ccacctgagc 

tggccggaca 

catttcggcc 

gctgcccgtt 

ttccgtgtag 

agcccgtggg 

ggaatcgccc 

ggtgctcctc 

gtagtttgaa 

cagcctccat 

taatttcact 

ccactgggtc 

ttagcaccgg 

cgctgtccac 

ttttcttctt 

tgcgcggcac 

tcatccgctt 

cctccatggt 

gctcctcttc 

tcgagaagaa 

ccgccaacgc 

ttatcgagca 

aggataaaaa 

acgaaaggca 

gccagtgcgc 

cggatgtcag 

aagaaaacgg 

cagaggtgct 



gcattaagtt 
cctccacgct 
tctccgccgc 
tcccctcccg 
aaaccccatc 
acctagatgg 
cttctgtcag 
gctcagttga 
tggtacaaat 
acaaggaccg 
atgatactaa 
gatttgttgg 
cctatccgct 
atcgcaccct 
acctgggcca 
tggatcccat 
gtgtgcaccg 
ccggcaacgc 
cagtgagcag 
cacctatgac 
caatacggcc 
ctcaaaaaca 
ttaccagttt 
ctgtataacg 
actattctgc 
caaccccacc 
acagcccacc 
ctacttccgc 
catgtaaaaa 
tctcgggtga 
ctgccgcgca 
ccacttaaac 
gcgcaccatc 
gcctccgccc 
cgccgggtgg 
ctccgcgttg 
gtgcccaggc 
ctgggcgtta 
ctttgcgcct 
ggccgcgtcg 
ccaccggttc 
ttcgctcgtc 
acacttaagc 
ctcgtgatgc 
catcatcgtc 
gttcagccag 
gttcgccttt 
gcccttctcc 
ttccgcttcg 
gtcttcattc 
tgggttgctg 
gattacctct 
gggcgcaatg 
cagcgcgtct 
ttttgggggc 
tgggggacgt 
ccgactggcc 
ggacagccta 
gcctaccacc 
ggacccaggt 
gcaagaccag 
tggcgactac 
cattatctgc 
ccttgcctac 
cacatgcgag 
tgccacctat 



tgatagcatt 
tgaggccatg 
caacatgctc 
caactgggcg 
actgggctcg 
aaccttttac 
ctggcctggc 
cggggagggt 
gctagctaac 
catgtactcc 
atacaaggac 
ctaccttgcc 
tataggcaag 
ttggcgcatc 
aaaccttctc 
ggacgagccc 
gccgcaccgc 
cacaacataa 
gaactgaaag 
aagcgctttc 
ggtcgcgaga 
tgctacctct 
gagtacgagt 
ctggaaaagt 
tgcatgtttc 
atgaacctta 
ctgcgtcgca 
agccacagtg 
taatgtacta 
ttatttaccc 
tcgctatgcg 
tcaggcacaa 
accaacgcgt 
tgcgcgcgcg 
tgcacgctgg 
ctcagggcga 
tttgagttgc 
ggatacagcg 
tcagagaaga 
tgcacgcagc 
ttcacgatct 
acatccattt 
tcgccttcga 
ttgtaggtca 
acaaaggtct 
gtcttgcata 
agatcgttat 
cacgcagaca 
ctgggctctt 
agccgccgca 
aaacccacca 
ggtgatggcg 
gccaaatccg 
tgtgatgagt 
gcccggggag 
cgcgccgcac 
atttccttct 
accgccccct 
ttccccgtcg 
tttgtaagcg 
gacaacgcag 
ctagatgtgg 
gacgcgttgc 
gaacgccacc 
cccaacccgc 
cacatctttt 



tgcctttacg 
cttagaaacg 
taccctatac 
gctttccgcg 
ggctacgacc 
ctcaaccaca 
aatgaccgcc 
tacaacgttg 
tacaacattg 
ttctttagaa 
taccaacagg 
cccaccatgc 
accgcagttg 
ccattctcca 
tacgccaact 
acccttcttt 
ggcgtcatcg 
agaagcaagc 
ccattgtcaa 
caggctttgt 
ctgggggcgt 
ttgagccctt 
cactcctgcg 
ccacccaaag 
tccacgcctt 
ttaccggggt 
accaggaaca 
cgcagattag 
gagacacttt 
ccacccttgc 
ccactggcag 
ccatccgcgg 
ttagcaggtc 
agttgcgata 
ccagcacgct 
acggagtcaa 
actcgcaccg 
cctgcataaa 
acatgccgca 
accttgcgtc 
tggccttgct 
caatcacgtg 
tctcagcgca 
cctctgcaaa 
tgttgctggt 
cggccgccag 
ccacgtggta 
cgatcggcac 
cctcttcctc 
ctgtgcgctt 
tttgtagcgc 
ggcgctcggg 
ccgccgaggt 
cttcctcgtc 
gcggcggcga 
cgcgtccgcg 
cctataggca 
ctgagttcgc 
aggcaccccc 
aagacgacga 
aggcaaacga 
gagacgacgt 
aagagcgcag 
tattctcacc 
gcctcaactt 
tccaaaactg 



ccaccttctt 
acaccaacga 
ccgccaacgc 
gctgggcctt 
cttattacac 
cctttaagaa 
tgcttacccc 
cccagtgtaa 
gctaccaggg 
acttccagcc 
tgggcatcct 
gcgaaggaca 
acagcattac 
gtaactttat 
ccgcccacgc 
atgttttgtt 
aaaccgtgta 
aacatcaaca 
agatcttggt 
ttctccacac 
acactggatg 
tggcttttct 
ccgtagcgcc 
cgtacagggg 
tgccaactgg 
acccaactcc 
gctctacagc 
gagcgccact 
caataaaggc 
cgtctgcgcc 
ggacacgttg 
cagctcggtg 
gggcgccgat 
cacagggttg 
cttgtcggag 
ctttggtagc 
tagtggcatc 
agccttgatc 
agacttgccg 
ggtgttggag 
agactgctcc 
ctccttattt 
gcggtgcagc 
cgactgcagg 
gaaggtcagc 
agcttccact 
cttgtccatc 
actcagcggg 
ttgcgtccgc 
acctcctttg 
cacatcttct 
cttgggagaa 
cgatggccgc 
ctcggactcg 
cggggacggg 
ctcgggggtg 
gaaaaagatc 
caccaccgcc 
gcttgaggag 
ggaccgctca 
ggaacaagtc 
gctgttgaag 
cgatgtgccc 
gcgcgtaccc 
ctaccccgta 
caagataccc 



20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
237 60 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24 600 
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ctstcctgcc gtgccaaccg cagccgagcg 
gtcatacctg atatcgcctc gctcaacgaa 
gacgagaagc gcgcggcaaa cgctctgcaa 
ggagtgttgg tggaactcga gggtgacaac 
gaggtcaccc actttgccta cccggcactt 
atgagtgagc tgatcgtgcg ccgtgcgcag 
caaacagagg agggcctacc cgcagttggc 
cgcgagcctg ccgacttgga ggagcgacgc 
gtggagcttg agtgcatgca gcggttcttt 
gaaacattgc actacacctt tcgacagggc 
gtggagctct gcaacctggt ctcctacctt 
aacgtgcttc attccacgct caagggcgag 
tacttatttc tatgctacac ctggcagacg 
gagtgcaacc tcaaggagct gcagaaactg 
gccttcaacg agcgctccgt ggccgcgcac 
cttaaaaccc tgcaacaggg tctgccagac 
aggaacttta tcctagagcg ctcaggaatc 
gactttgtgc ccattaagta ccgcgaatgc 
ctgcagctag ccaactacct tgcctaccac 
ggtctactgg agtgtcactg tcgctgcaac 
aattcgcagc tgcttaacga aagtcaaatt 
cctgacgaaa agtccgcggc tccggggttg 
taccttcgca aatttgtacc tgaggactac 
caatcccgcc cgccaaatgc ggagcttacc 
ggccaattgc aagccatcaa caaagcccgc 
gtttacttgg acccccagtc cggcgaggag 
tatcagcagc agccgcgggc ccttgcttcc 
gccgccgcca cccacggacg aggaggaata 
cgaggaggag gaggacatga tggaagactg 
cgaagaggtg tcagacgaaa caccgtcacc 
gaaatcggca accggttcca gcatggctac 
gcccgttcgc cgacccaacc gtagatggga 
gcagccgccg ccgttagccc aagagcaaca 
gcacaagaac gccatagttg cttgcttgca 
ccgctttctt ctctaccatc acggcgtggc 
tcatctctac agcccatact gcaccggcgg 
cacagaagca aaggcgaccg gatagcaaga 
cggcagcagc aggaggagga gcgctgcgtc 
agcttagaaa caggattttt cccactctgt 
aacaagagct gaaaataaaa aacaggtctc 
acaaaagcga agatcagctt cggcgcacgc 
actgcgcgct gactcttaag gactagtttc 
tacgtcatct ccagcggcca cacccggcgc 
aggaaattcc cacgccctac atgtggagtt 
ctgcccaaga ctactcaacc cgaataaact 
gggtcaacgg aatccgcgcc caccgaaacc 
ccacacctcg taataacctt aatccccgta 
gtcccgctcc caccactgtg gtacttccca 
actcaggggc gcagcttgcg ggcggctttc 
taactcacct gacaatcaga gggcgaggta 
cgcttggtct ccgtccggac gggacatttc 
cgcctcgtca ggcaatccta actctgcaga 
ttggaactct gcaatttatt gaggagtttg 
gacctcccgg ccactatccg gatcaattta 
cggacggcta cgactgataa ttaagtggag 
tccactgtcg ccgccacaag tgctttgccc 
tgcccgagga tcatatcgag gatctttgtt 
gaaattaaaa tatactgggg ctcctatcgc 
ccaagcaaac caaggcgaac cttacctggt 
aacagtttca acccagacgg agtgagtcta 
atcagaaaaa acaccaccct ccttacctgc 
ttcctggatg tcagcatctg actttggcca 
actacagcga cccaccctaa cagagatgac 
acttacatct accacaaata caccccaagt 
gggcatgtgg tggttctcca tagcgcttat 
ctgctgccta aagcgcaaac gcgcccgacc 



gacaagcagc tggccttgcg gcagggcgct 24 660 
gtgccaaaaa tctttgaggg tcttggacgc 24720 
caggaaaaca gcgaaaatga aagtcactct 24780 
gcgcgcctag ccgtactaaa acgcagcatc 24840 
aacctacccc ccaaggtcat gagcacagtc 24 900 
cccctggaga gggatgcaaa tttgcaagaa 24960 
gacgagcagc tagcgcgctg gcttcaaacg 25020 
aaactaatga tggccgcagt gctcgttacc 25080 
gctgacccgg agatgcagcg caagctagag 25140 
tacgtacgcc aggcctgcaa gatctccaac 25200 
ggaattttgc acgaaaaccg ccttgggcaa 25260 
gcgcgccgcg actacgtccg cgactgcgtt 25320 
gccatgggcg tttggcagca gtgcttggag 25380 
ctaaagcaaa acttgaagga cctatggacg 254 40 
ctggcggaca tcattttccc cgaacgcctg 25500 
ttcaccagtc aaagcatgtt gcagaacttt 25560 
ttgcccgcca cctgctgtgc acttcctagc 25620 
cctccgccgc tttggggcca ctgctacctt 25680 
tctgacataa tggaagacgt gagcggtgac 25740 
ctatgcaccc cgcaccgctc cctggtttgc 25800 
atcggtacct ttgagctgca gggtccctcg 25860 
aaactcactc cggggctgtg gacgtcggct 25920 
cacgcccacg agattaggtt ctacgaagac 25980 
gcctgcgtca ttacccaggg ccacattctt 26040 
caagagtttc tgctacgaaa gggacggggg 26100 
ctcaacccaa tccccccgcc gccgcagccc 26160 
caggatggca cccaaaaaga agctgcagct 26220 
ctgggacagt caggcagagg aggttttgga 2 6280 
ggagagccta gacgaggaag cttccgaggt 2 6340 
ctcggtcgca ttcccctcgc cggcgcccca 264 00 
aacctccgct cctcaggcgc cgccggcact 264 60 
caccactgga accagggccg gtaagtccaa 26520 
acagcgccaa ggctaccgct catggcgcgg 26580 
agactgtggg ggcaacatct ccttcgcccg 26640 
cttcccccgt aacatcctgc attactaccg 26700 
cagcggcagc ggcagcaaca gcagcggcca 267 60 
ctctgacaaa gcccaagaaa tccacagcgg 26820 
tggcgcccaa cgaacccgta tcgacccgcg 26880 
atgctatatt tcaacagagc aggggccaag 26940 
tgcgatccct cacccgcagc tgcctgtatc 27000 
tggaagacgc ggaggctctc ttcagtaaat 27060 
gcgccctttc tcaaatttaa gcgcgaaaac 27120 
cagcacctgt cgtcagcgcc attatgagca 27180 
accagccaca aatgggactt gcggctggag 27240 
acatgagcgc gggaccccac atgatatccc 27300 
gaattctctt ggaacaggcg* gctattacca 27360 
gttggcccgc tgccctggtg taccaggaaa 27420 
gagacgccca ggccgaagtt cagatgacta 27480 
gtcacagggt gcggtcgccc gggcagggta 27540 
ttcagctcaa cgacgagtcg gtgagctcct 27600 
agatcggcgg cgccggccgt ccttcattca 27 660 
cctcgtcctc tgagccgcgc tctggaggca 27720 
tgccatcggt ctactttaac cccttctcgg 27780 
ttcctaactt tgacgcggta aaggactcgg 27840 
aggcagagca actgcgcctg aaacacctgg 27900 
gcgactccgg tgagttttgc tactttgaat 27960 
gccatctctg tgctgagtat aataaataca 28020 
catcctgtaa acgccaccgt cttcacccgc 28080 
acttttaaca tctctccctc tgtgatttac 28140 
cgagagaacc tctccgagct cagctactcc 28200 
cgggaacgta cccttaatta aaagtcaggc 282 60 
gcacctgtcc cgcggatttg ttccagtcca 28320 
caacacaacc aacgcggccg ccgctaccgg 28380 
ttctgccttt gtcaataact gggataactt 28440 
gtttgtatgc cttattatta tgtggctcat 28500 
acccatctat agtcccatca ttgtgctaca 28560 
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cccaaacaat gatggaatcc atagattgga 
agtatgatta aatgagatta attaaggaat 
tgccctcctc ccagctctgg tattgcagct 
taaatggaat gtcagtttcc tcctgttcct 
tgcagatgaa gcgcgcaaga ccgtctgaag 
cggaaaccgg tcctccaact gtgccttttc 
ttcaagagag tccccctggg gtactctctt 
atggcatgct tgcgctcaaa atgggcaacg 
cctcccaaaa tgtaaccact gtgagcccac 
tggaaatatc tgcacccctc acagttacct 
ctctaatggt cgcgggcaac acactcacca 
actccaaact tagcattgcc acccaaggac 
tgcaaacatc aggccccctc accaccaccg 
cccctctaac tactgccact ggtagcttgg 
aaaatggaaa actaggacta aagtacgggg 
ctttgaccgt agcaactggt ccaggtgtga 
ttactggagc cttgggtttt gattcacaag 
taaggattga ttctcaaaac agacgcctta 
aaaaccaact aaatctaaga ctaggacagg 
tggatattaa ctacaacaaa ggcctttact 
ttgaggttaa cctaagcact gccaaggggt 
atgcaggaga tgggcttgaa tttggttcac 
caaaaattgg ccatggccta gaatttgatt 
ctggccttag ttttgacagc acaggtgcca 
taactttgtg gaccacacca gctccatctc 
ctaaactcac tttggtctta acaaaatgtg 
tggctgttaa aggcagtttg gctccaatat 
taagatttga cgaaaatgga gtgctactaa 
actttagaaa tggagatctt actgaaggca 
ctaacctatc agcttatcca aaatctcacg 
aagtttactt aaacggagac aaaactaaac 
cacaggaaac aggagacaca actccaagtg 
ctggccacaa ctacattaat gaaatatttg 
cccaagaata aagaatcgtt tgtgttatgt 
atttcaagtc atttttcatt cagtagtata 
ccgtacctta atcaaactca cagaacccta 
acagagtaca cagtcctttc tccccggctg 
gacatattct taggtgttat attccacacg 
atattaataa actccccggg cagctcactt 
acaggctgct gtccaacttg cggttgctta 
atgggggtag agtcataatc gtgcatcagg 
ataaactgct gccgccgccg ctccgtcctg 
gcgatgattc gcaccgcccg cagcataagg 
ctgatctcac ttaaatcagc acagtaactg 
ccacagtgca aggcgctgta tccaaagctc 
tcataccaca agcgcaggta gattaagtgg 
attacctctt ttggcatgtt gtaattcacc 
aacatggcgc catccaccac catcctaaac 
cactgcaggg aaccgggact ggaacaatga 
atcatcatgc tcgtcatgat atcaatgttg 
ctcaggatta caagctcctc ccgcgttaga 
atcagcgtaa atcccacact gcagggaaga 
aaagtgttac attcgggcag cagcggatga 
tcaaaaggag gtagacgatc cctactgtac 
ggtcgtagtg tcatgccaaa tggaacgccg 
ggtgcgggcg tgacaaacag atctgcgtct 
gtagttgtag tatatccact ctctcaaagc 
gtaaactcct tcatgcgccg ctgccctgat 
cagccaacct acacattcgt tctgcgagtc 
aaccatgttt ttttttttat tccaaaagat 
agtgaacgcg ctcccctccg gtggcgtggt 
catttgtaag atgttgcaca atggcttcca 
cgtaaaggct aaacccttca gggtgaatct 
tgcccaaata attctcatct cgccaccttc 
taagtccggc cattgtaaaa atctgctcca 
gaatcatgat tgcaaaaatt caggttcctc 



cggactgaaa cacatgttct tttctcttac 28620 
ttctgtccag tttattcagc agcacctcct 28680 
tcctcctggc tgcaaacttt ctccacaatc 28740 
gtccatccgc acccactatc ttcatgttgt 28800 
ataccttcaa ccccgtgtat ccatatgaca 28860 
ttactcctcc ctttgtatcc cccaatgggt 28920 
tgcgcctatc cgaacctcta gttacctcca 28980 
gcctctctct ggacgaggcc ggcaacctta 29040 
ctctcaaaaa aaccaagtca aacataaacc 29100 
cagaagccct aactgtggct gccgccgcac 29160 
tgcaatcaca ggccccgcta accgtgcacg 29220 
ccctcacagt gtcagaagga aagctagccc 29280 
atagcagtac ccttactatc actgcctcac 29340 
gcattgactt gaaagagccc atttatacac 29400 
ctcctttgca tgtaacagac gacctaaaca 29460 
ctattaataa tacttccttg caaactaaag 29520 
gcaatatgca acttaatgta gcaggaggac 29580 
tacttgatgt tagttatccg tttgatgctc 29640 
gccctctttt tataaactca gcccacaact 29700 
tgtttacagc ttcaaacaat tccaaaaagc 29760 
tgatgtttga cgctacagcc atagccatta 29820 
ctaatgcacc aaacacaaat cccctcaaaa 29880 
caaacaaggc tatggttcct aaactaggaa 29940 
ttacagtagg aaacaaaaat aatgataagc 30000 
ctaactgtag actaaatgca gagaaagatg 30060 
gcagtcaaat acttgctaca gtttcagttt 30120 
ctggaacagt tcaaagtgct catcttatta 30180 
acaattcctt cctggaccca gaatattgga 30240 
cagcctatac aaacgctgtt ggatttatgc 30300 
gtaaaactgc caaaagtaac attgtcagtc 30360 
ctgtaacact aaccattaca ctaaacggta 30420 
catactctat gtcattttca tgggactggt 30480 
ccacatcctc ttacactttt tcatacattg 30540 
ttcaacgtgt ttatttttca attgcagaaa 30600 
gccccaccac cacatagctt atacagatca 30660 
gtattcaacc tgccacctcc ctcccaacac 30720 
gccttaaaaa gcatcatatc atgggtaaca 30780 
gtttcctgtc gagccaaacg ctcatcagtg 30840 
aagttcatgt cgctgtccag ctgctgagcc 30900 
acgggcggcg aaggagaagt ccacgcctac 30960 
atagggcggt ggtgctgcag cagcgcgcga 31020 
caggaataca acatggcagt ggtctcctca 31080 
cgccttgtcc tccgggcaca gcagcgcacc 31140 
cagcacagca ccacaatatt gttcaaaatc 31200 
atggcgggga ccacagaacc cacgtggcca 31260 
cgacccctca taaacacgct ggacataaac 31320 
acctcccggt accatataaa cctctgatta 31380 
cagctggcca aaacctgccc gccggctata 314 40 
cagtggagag cccaggactc gtaaccatgg 31500 
gcacaacaca ggcacacgtg catacacttc 31560 
accatatccc agggaacaac ccattcctga 31620 
cctcgcacgt aactcacgtt gtgcattgtc 31680 
tcctccagta tggtagcgcg ggtttctgtc 31740 
ggagtgcgcc gagacaaccg agatcgtgtt 31800 
gacgtagtca tatttcctga agcaaaacca 31860 
ccggtctcgc cgcttagatc gctctgtgta 31920 
atccaggcgc cccctggctt cgggttctat 31980 
aacatccacc accgcagaat aagccacacc 32040 
acacacggga ggagcgggaa gagctggaag 32100 
tatccaaaac ctcaaaatga agatctatta 32160 
caaactctac agccaaagaa cagataatgg 32220 
aaaggcaaac ggccctcacg tccaagtgga 32280 
cctctataaa cattccagca ccttcaacca 32340 
tcaatatatc tctaagcaaa tcccgaatat 32400 
gagcgccctc caccttcagc ctcaagcagc 324 60 
acagacctgt ataagattca aaagcggaac 32520 
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attaacaaaa ataccgcgat cccgtaggtc ccttcgcagg gccagctgaa cataatcgtg 32580 

caggtctgca cggaccagcg cggccacttc cccgccagga accttgacaa aagaacccac 32640 

actgattatg acacgcatac tcggagctat gctaaccagc gtagccccga tgtaagcttt 32700 

gttgcatggg cggcgatata aaatgcaagg tgctgctcaa aaaatcaggc aaagcctcgc 32760 

gcaaaaaaga aagcacatcg tagtcatgct catgcagata aaggcaggta agctccggaa 32820 

ccaccacaga aaaagacacc atttttctct caaacatgtc tgcgggtttc tgcataaaca 32880 

caaaataaaa taacaaaaaa acatttaaac attagaagcc tgtcttacaa caggaaaaac 32940 

aacccttata agcataagac ggactacggc catgccggcg tgaccgtaaa aaaactggtc 33000 

accgtgatta aaaagcacca ccgacagctc ctcggtcatg tccggagtca taatgtaaga 33060 

ctcggtaaac acatcaggtt gattcatcgg tcagtgctaa aaagcgaccg aaatagcccg 33120 

ggggaataca tacccgcagg cgtagagaca acattacagc ccccatagga ggtataacaa 33180 

aattaatagg agagaaaaac acataaacac ctgaaaaacc ctcctgccta ggcaaaatag 33240 

caccctcccg ctccagaaca acatacagcg cttcacagcg gcagcctaac agtcagcctt 33300 

accagtaaaa aagaaaacct attaaaaaaa caccactcga cacggcacca gctcaatcag 33360 

tcacagtgta aaaaagggcc aagtgcagag cgagtatata taggactaaa aaatgacgta 33420 

acggttaaag tccacaaaaa acacccagaa aaccgcacgc gaacctacgc ccagaaacga 33480 

aagccaaaaa acccacaact tcctcaaatc gtcacttccg ttttcccacg ttacgtaact 33540 

tcccatttta agaaaactac aattcccaac acatacaagt tactccgccc taaaacctac 33600 

gtcacccgcc ccgttcccac gccccgcgcc acgtcacaaa ctccaccccc tcattatcat 33660 

attggcttca atccaaaata aggtatatta ttgatgatg 33699 

<210> 4 
<211> 34448 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 4 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 
cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 
tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 
tccgacaccg ggactgaaaa tgagacatat tatctgccac ggaggtgtta ttaccgaaga 600 
aatggccgcc agtcttttgg accagctgat cgaagaggta ctggctgata atcttccacc 660 
tcctagccat tttgaaccac ctacccttca cgaactgtat gatttagacg tgacggcccc 720 
cgaagatccc aacgaggagg cggtttcgca gatttttccc gactctgtaa tgttggcggt 780 
gcaggaaggg attgacttac tcacttttcc gccggcgccc ggttctccgg agccgcctca 840 
cctttcccgg cagcccgagc agccggagca gagagccttg ggtccggttt ctatgccaaa 900 
ccttgtaccg gaggtgatcg atcttacctg ccacgaggct ggctttccac ccagtgacga 960 
cgaggatgaa gagggtgagg agtttgtgtt agattatgtg gagcaccccg ggcacggttg 1020 
caggtcttgt cattatcacc ggaggaatac gggggaccca gatattatgt gttcgctttg 1080 
ctatatgagg acctgtggca tgtttgtcta cagtaagtga aaattatggg cagtgggtga 1140 
tagagtggtg ggtttggtgt ggtaattttt tttttaattt ttacagtttt gtggtttaaa 1200 
gaattttgta ttgtgatttt tttaaaaggt cctgtgtctg aacctgagcc tgagcccgag 1260 
ccagaaccgg agcctgcaag acctacccgc cgtcctaaaa tggcgcctgc tatcctgaga 1320 
cgcccgacat cacctgtgtc tagagaatgc aatagtagta cggatagctg tgactccggt 1380 
ccttctaaca cacctcctga gatacacccg gtggtcccgc tgtgccccat taaaccagtt 1440 
gccgtgagag ttggtgggcg tcgccaggct gtggaatgta tcgaggactt gcttaacgag 1500 
cctgggcaac ctttggactt gagctgtaaa cgccccaggc cataaggtgt aaacctgtga 1560 
ttgcgtgtgt ggttaacgcc tttgtttgct gaatgagttg atgtaagttt aataaagggt 1620 
gagataatgt ttaacttgca tggcgtgtta aatggggcgg ggcttaaagg gtatataatg 1680 
cgccgtgggc taatcttggt tacatctgac ctcatggagg cttgggagtg tttggaagat 1740 
ttttctgctg tgcgtaactt gctggaacag agctctaaca gtacctcttg gttttggagg 1800 
tttctgtggg gctcatccca ggcaaagtta gtctgcagaa ttaaggagga ttacaagtgg 1860 
gaatttgaag agcttttgaa atcctgtggt gagctgtttg attctttgaa tctgggtcac 1920 
caggcgcttt tccaagagaa ggtcatcaag actttggatt tttccacacc ggggcgcgct 1980 
gcggctgctg ttgctttttt gagttttata aaggataaat ggagcgaaga aacccatctg 2040 
agcggggggt acctgctgga ttttctggcc atgcatctgt ggagagcggt tgtgagacac 2100 
aagaatcgcc tgctactgtt gtcttccgtc cgcccggcga taataccgac ggaggagcag 2160 
cagcagcagc aggaggaagc caggcggcgg cggcaggagc agagcccatg gaacccgaga 2220 
gccggcctgg accctcggga atgaatgttg tacaggtggc tgaactgtat ccagaactga 2280 
gacgcatttt gacaattaca gaggatgggc aggggctaaa gggggtaaag agggagcggg 2340 
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gggcttgtga ggctacagag gaggctagga 
gtcctgagtg tattactttt caacagatca 
tggcgcagaa gtattccata gagcagctga 
ttgaggaggc tattagggta tatgcaaagg 
tcagcaaact tgtaaatatc aggaattgtt 
agatagatac ggaggatagg gtggccttta 
tgcttggcat ggacggggtg gttattatga 
gtacggtttt cctggccaat eccaacctta 
acaatacctg tgtggaagcc tggaccgatg 
gctggaaggg ggtggtgtgt cgccccaaaa 
aaaggtgtac cttgggtatc ctgtctgagg 
ccgactgtgg ttgcttcatg ctagtgaaaa 
gtggcaactg cgaggacagg gcctctcaga 
tgctgaagac cattcacgta gccagccact 
acatactgac ccgctgttcc ttgcatttgg 
aatgcaattt gagtcacact aagatattgc 
tgaacggggt gtttgacatg accatgaaga 
gcaccaggtg cagaccctgc gagtgtggcg 
tggatgtgac cgaggagctg aggcccgatc 
ttggctctag cgatgaagat acagattgag 
tgggaaagaa tatataaggt gggggtctta 
ccgccgccat gagcaccaac tcgtttgatg 
gcatgccccc atgggccggg gtgcgtcaga 
ccgtcctgcc cgcaaactct actaccttga 
agactgcagc ctccgccgcc gcttcagccg 
actttgcttt cctgagcccg cttgcaagca 
acaagttgac ggctcttttg gcacaattgg 
ctcagcagct gttggatctg cgccagcagg 
atgcggttta aaacataaat aaaaaaccag 
cttgctgtct ttatttaggg gttttgcgcg 
cgttgagggt cctgtgtatt ttttccagga 
acatgggcat aagcccgtct ctggggtgga 
gggtggtgtt gtagatgatc cagtcgtagc 
ctttcagtag caagctgatt gccaggggca 
taagctggga tgggtgcata cgtggggata 
tggctatgtt cccagccata tccctccggg 
tgtatccggt gcacttggga aatttgtcat 
tggagacgcc cttgtgacct ccaagatttt 
gcccacgggc ggcggcctgg gcgaagatat 
ccaggatgag atcgtcatag gccattttta 
gtataatggt tccatccggc ccaggggcgt 
ctttgagttc agatgggggg atcatgtcta 
gggtagggga gatcagctgg gaagaaagca 
cggtgggccc gtaaatcaca cctattaccg 
tgccgtcatc cctgagcagg ggggccactt 
ccctgaccaa atccgccaga aggcgctcgc 
caaagttttt caacggtttg agaccgtccg 
gcagttccag gcggtcccac agctcggtca 
ctcctcgttt cgcgggttgg ggcggctttc 
acgggccagg gtcatgtctt tccacgggcg 
ggtgaagggg tgcgctccgg gctgcgcgct 
ggtgctgaag cgctgccggt cttcgccctg 
gtcatagtcc agcccctccg cggcgtggcc 
gccgcacgag gggcagtgca gacttttgag 
ttccggggag taggcatccg cgccgcaggc 
ggtgagctct ggccgttcgg ggtcaaaaac 
cttacctctg gtttccatga gccggtgtcc 
cccgtataca gacttgagag gcctgtcctc 
aaactcggac cactctgaga caaaggctcg 
ggaggggtag cggtcgttgt ccactagggg 
gtcgccctct tcggcatcaa ggaaggtgat 
tgttcctgaa ggggggctat aaaagggggt 
atcgctgtct gcgagggcca gctgttgggg 
ttctgcgcta agattgtcag tttccaaaaa 
ggtgatgcct ttgagggtgg ccgcatccat 
aagcttggtg gcaaacgacc cgtagagggc 



atctagcttt tagcttaaxig accagacacc 2400 
aggataattg cgctaatgag cttgatctgc 24 60 
ccacttactg gctgcagcca ggggatgatt 2520 
tggcacttag gccagattgc aagtacaaga 2580 
gctacatttc tgggaacggg gccgaggtgg 2640 
gatgtagcat gataaatatg tggccggggg 2700 
atgtaaggtt tactggcccc aattttagcg 2760 
tcctacacgg tgtaagcttc tatgggttta 2820 
taagggttcg gggctgtgcc ttttactgct 2880 
gcagggcttc aattaagaaa tgcctctttg 2940 
gtaactccag ggtgcgccac aatgtggcct 3000 
gcgtggctgt gattaagcat aacatggtat 3060 
tgctgacctg ctcggacggc aactgtcacc 3120 
ctcgcaaggc ctggccagtg tttgagcata 3180 
gtaacaggag gggggtgttc ctaccttacc 3240 
ttgagcccga gagcatgtcc aaggtgaacc 3300 
tctggaaggt gctgaggtac gatgagaccc 3360 
gtaaacatat taggaaccag cctgtgatgc 3420 
acttggtgct ggcctgcacc cgcgctgagt 3480 
gtactgaaat gtgtgggcgt ggcttaaggg 354 0 
tgtagttttg tatctgtttt gcagcagccg 3600 
gaagcattgt gagctcatat ttgacaacgc 3660 
atgtgatggg ctccagcatt gatggtcgcc 3720 
cctacgagac cgtgtctgga acgccgttgg 3780 
ctgcagccac cgcccgcggg attgtgactg 384 0 
gtgcagcttc ccgttcatcc gcccgcgatg 3900 
attctttgac ccgggaactt aatgtcgttt 3960 
tttctgccct gaaggcttcc tcccctccca 4020 
actctgtttg gatttggatc aagcaagtgt 4080 
cgcggtaggc ccgggaccag cggtctcggt 4140 
cgtggtaaag gtgactctgg atgttcagat 4200 
ggtagcacca ctgcagagct tcatgctgcg 4260 
aggagcgctg ggcgtggtgc ctaaaaatgt 4320 
ggcccttggt gtaagtgttt acaaagcggt 4380 
tgagatgcat cttggactgt atttttaggt 4440 
gattcatgtt gtgcagaacc accagcacag 4500 
gtagcttaga aggaaatgcg tggaagaact 4560 
ccatgcattc gtccataatg atggcaatgg 4620 
ttctgggatc actaacgtca tagttgtgtt 4680 
caaagcgcgg gcggagggtg ccagactgcg 4740 
agttaccctc acagatttgc atttcccacg 4800 
cctgcggggc gatgaagaaa acggtttccg 4860 
ggttcctgag cagctgcgac ttaccgcagc 4920 
ggtgcaactg gtagttaaga gagctgcagc 4980 
cgttaagcat gtccctgact cgcatgtttt 5040 
cgcccagcga tagcagttct tgcaaggaag 5100 
ccgtaggcat gcttttgagc gtttgaccaa 5160 
cctgctctac ggcatctcga tccagcatat 5220 
gctgtacggc agtagtcggt gctcgtccag 5280 
cagggtcctc gtcagcgtag tctgggtcac 5340 
ggccagggtg cgcttgaggc tggtcctgct 5400 
cgcgtcggcc aggtagcatt tgaccatggt 54 60 
cttggcgcgc agcttgccct tggaggaggc 5520 
ggcgtagagc ttgggcgcga gaaataccga 5580 
cccgcagacg gtctcgcatt ccacgagcca 5640 
caggtttccc ccatgctttt tgatgcgttt 5700 
acgctcggtg acgaaaaggc tgtccgtgtc 5760 
gagcggtgtt ccgcggtcct cctcgtatag 5820 
cgtccaggcc agcacgaagg aggctaagtg 5880 
gtccacrcgc tccagggtgt gaagacacat 5940 
tggtttgtag gtgtaggcca cgtgaccggg 6000 
gggggcgcgt tcgtcctcac tctcttccgc 6060 
tgagtactcc ctctgaaaag cgggcatgac 6120 
cgaggaggat ttgatattca cctggcccgc 6180 
ctggtcagaa aagacaatct ttttgttgtc 6240 
gttggacagc aacttggcga tggagcgcag 6300 
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ggtttggttt trgtcgcgat cggcgcgctc cttggccgcg atgtttagct gcacgtattc 6360 
gcgcgcaacg caccgccatt cgggaaagac ggtggtgcgc tcgtcgggca ccaggtgcac 6420 
gcgccaaccg cggttgtgca gggtgacaag gtcaacgcrg gtggctacct ctccgcgtag 6480 
gcgctcgttg gtccagcaga ggcggccgcc cttgcgcgag cagaatggcg gtagggggtc 6540 
tagctgcgtc tcgtccgggg ggtctgcgtc cacggtaaag accccgggca gcaggcgcgc 6600 
gtcgaagtag tctatcttgc atccttgcaa gtctagcgcc tgctgccatg cgcgggcggc 6660 
aagcgcgcgc tcgtatgggt tgagtggggg accccatggc atggggtggg tgagcgcgga 6720 
ggcgtacatg ccgcaaatgt cgtaaacgta gaggggctct ctgagtattc caagatatgt 6780 
agggtagcat cttccaccgc ggatgctggc gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 
agcgaggagg tcgggaccga ggttgctacg ggcgggctgc tccgctcgga agactatctg 6900 
cctgaagatg gcatgtgagt tggatgatat ggttggacgc tggaagacgt tgaagctggc 6960 
gtctgtgaga cctaccgcgt cacgcacgaa ggaggcgtag gagtcgcgca gcttgttgac 7020 
cagctcggcg gtgacctgca cgtctagggc gcagtagtcc agggtttcct tgatgatgtc 7080 
atacttatcc tgtccctttt ttttccacag ctcgcggttg aggacaaact cttcgcggtc 7140 
tttccagtac tcttggatcg gaaacccgtc ggcctccgaa cggtaagagc ctagcatgta 7200 
gaactggttg acggcctggt aggcgcagca tcccttttct acgggtagcg cgtatgcctg 7260 
cgcggccttc cggagcgagg tgtgggtgag cgcaaaggtg tccctgacca tgactttgag 7320 
gtactggtat ttgaagtcag tgtcgtcgca tccgccctgc tcccagagca aaaagtccgt 7380 
gcgctttttg gaacgcggat ttggcagggc gaaggtgaca tcgttgaaga gtatctttcc 7440 
cgcgcgaggc ataaagttgc gtgtgatgcg gaagggtccc ggcacctcgg aacggttgtt 7500 
aattacctgg gcggcgagca cgatctcgtc aaagccgttg atgttgtggc ccacaatgta 7560 
aagttccaag aagcgcggga tgcccttgat ggaaggcaat tttttaagtt cctcgtaggt 7620 
gagctcttca ggggagctga gcccgtgctc tgaaagggcc cagtctgcaa gatgagggtt 7680 
ggaagcgacg aatgagctcc acaggtcacg ggccattagc atttgcaggt ggtcgcgaaa 7740 
ggtcctaaac tggcgaccta tggccatttt ttctggggtg atgcagtaga aggtaagcgg 7800 
gtcttgttcc cagcggtccc atccaaggtt cgcggctagg tctcgcgcgg cagtcactag 7860 
aggctcatct ccgccgaact tcatgaccag catgaagggc acgagctgct tcccaaaggc 7920 
ccccatccaa gtataggtct ctacatcgta ggtgacaaag agacgctcgg tgcgaggatg 7980 
cgagccgatc gggaagaact ggatctcccg ccaccaattg gaggagtggc tattgatgtg 8040 
gtgaaagtag aagtccctgc gacgggccga acactcgtgc tggcttttgt aaaaacgtgc 8100 
gcagtactgg cagcggtgca cgggctgtac atcctgcacg aggttgacct gacgaccgcg 8160 
cacaaggaag cagagtggga atttgagccc ctcgcctggc gggtttggct ggtggtcttc 8220 
tacttcggct gcttgtcctt gaccgtctgg ctgctcgagg ggagttacgg tggatcggac 8280 
caccacgccg cgcgagccca aagtccagat gtccgcgcgc ggcggtcgga gcttgatgac 834 0 
aacatcgcgc agatgggagc tgtccatggt ctggagctcc cgcggcgtca ggtcaggcgg 8400 
gagctcctgc aggtttacct cgcatagacg ggtcagggcg cgggctagat ccaggtgata 84 60 
cctaatttcc aggggctggt tggtggcggc gtcgatggct tgcaagaggc cgcatccccg 8520 
cggcgcgact acggtaccgc gcggcgggcg gtgggccgcg ggggtgtcct tggatgatgc 8580 
atctaaaagc ggtgacgcgg gcgagccccc ggaggtaggg ggggctccgg acccgccggg 8640 
agagggggca ggggcacgtc ggcgccgcgc gcgggcagga gctggtgctg cgcgcgtagg 8700 
ttgctggcga acgcgacgac gcggcggttg atctcctgaa tctggcgcct ctgcgtgaag 8760 
acgacgggcc cggtgagctt gagcctgaaa gagagttcga cagaatcaat ttcggtgtcg 8820 
ttgacggcgg cctggcgcaa aatctcctgc acgtctcctg agttgtcttg ataggcgatc 8880 
tcggccatga actgctcgat ctcttcctcc tggagatctc cgcgtccggc tcgctccacg 8940 
gtggcggcga ggtcgttgga aatgcgggcc atgagctgcg agaaggcgtt gaggcctccc 9000 
tcgttccaga cgcggctgta gaccacgccc ccttcggcat cgcgggcgcg catgaccacc 9060 
tgcgcgagat tgagctccac gtgccgggcg aagacggcgt agtttcgcag gcgctgaaag 9120 
aggtagttga gggtggtggc ggtgtgttct gccacgaaga agtacataac ccagcgtcgc 9180 
aacgtggatt cgttgatatc ccccaaggcc tcaaggcgct ccatggcctc gtagaagtcc 9240 
acggcgaagt tgaaaaactg ggagttgcgc gccgacacgg ttaactcctc ctccagaaga 9300 
cggatgagct cggcgacagt gtcgcgcacc tcgcgctcaa aggctacagg ggcctcttct 9360 
tcttcttcaa tctcctcttc cataagggcc tccccttctt cttcttctgg cggcggtggg 9420 
ggagggggga cacggcggcg acgacggcgc accgggaggc ggtcgacaaa gcgctcgatc 9480 
atctccccgc ggcgacggcg catggtctcg gtgacggcgc ggccgttctc gcgggggcgc 954 0 
agttggaaga cgccgcccgt catgtcccgg ttatgggttg gcggggggct gccatgcggc 9600 
agggatacgg cgctaacgat gcatctcaac aattgttgtg taggtactcc gccgccgagg 9660 
gacctgagcg agtccgcatc gaccggatcg gaaaacctct cgagaaaggc gtctaaccag 9720 
tcacagtcgc aaggtaggct gagcaccgtg gcgggcggca gcgggcggcg gtcggggttg 9780 
tttctggcgg aggtgctgct gatgatgtaa ttaaagtagg cggtcttgag acggcggatg 9840 
gtcgacagaa gcaccatgtc cttgggtccg gcctgctgaa tgcgcaggcg gtcggccatg 9900 
ccccaggctt cgttttgaca tcggcgcagg tctttgtagt agtcttgcat gagcctttct 9960 
accggcactt cttcttctcc ttcctcttgt cctgcatctc ttgcatctat cgctgcggcg 10020 
gcggcggagt ttggccgtag gtggcgccct cttcctccca tgcgtgtgac cccgaagccc 10080 
ctcatcggct gaagcagggc taggtcggcg acaacgcgct cggctaatat ggcctgctgc 1014 0 
acctgcgtga gggtagactg gaagtcatcc atgtccacaa agcggtggta tgcgcccgtg 10200 
ttgatggtgt aagtgcagtt ggccataacg gaccagttaa cggtctggtg acccggctgc 10260 
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gagagctcgg tgtacctgag acgcgagtaa 
gtccgcacca ggtactggta tcccaccaaa 
cagcgtaggg tggccggggc tccgggggcg 
tagatgtacc tggacatcca ggtgatgccg 
cggacgcggt tccagatgtt gcgcagcggc 
ccggtcaggc gcgcgcaatc gttgacgctc 
ggcactcttc cgtggtctgg tggataaatt 
tcgagccccg tatccggccg tccgccgtga 
caggtgtgcg acgtcagaca acgggggagt 
gctgctgcgc tagctttttt ggccactggc 
gcgaaagcat taagtggctc gctccctgta 
gcgggacccc cggttcgagt ctcggaccgg 
ccgtcatgca agaccccgct tgcaaattcc 
ttttcccaga tgcatccggt gctgcggcag 
caagagcagc ggcagacatg cagggcaccc 
acatccgcgg ttgacgcggc agcagatggt 
cactacctgg acttggagga gggcgagggc 
cggtacccaa gggtgcagct gaagcgtgat 
ctgtttcgcg accgcgaggg agaggagccc 
gggcgcgagc tgcggcatgg cctgaatcgc 
cccgacgcgc gaaccgggat tagtcccgcg 
accgcatacg agcagacggt gaaccaggag 
gtgcgtacgc ttgtggcgcg cgaggaggtg 
gtaagcgcgc tggagcaaaa cccaaatagc 
gtgcagcaca gcagggacaa cgaggcattc 
gagggccgct ggctgctcga tttgataaac 
agcttgagcc tggctgacaa ggtggccgcc 
ttttacgccc gcaagatata ccatacccct 
gaggggttct acatgcgcat ggcgctgaag 
tatcgcaacg agcgcatcca caaggccgtg 
cgcgagctga tgcacagcct gcaaagggcc 
gccgagtcct actttgacgc gggcgctgac 
gaggcagctg gggccggacc tgggctggcg 
ggcgtggagg aatatgacga ggacgatgag 
gtgatgtttc tgatcagatg atgcaagacg 
agagccagcc gtccggcctt aactccacgg 
tgtcgctgac tgcgcgcaat cctgacgcgt 
ccgcaattct ggaagcggtg gtcccggcgc 
cgatcgtaaa cgcgctggcc gaaaacaggg 
acgacgcgct gcttcagcgc gtggctcgtt 
accggctggt gggggatgtg cgcgaggccg 
gcaacctggg ctccatggtt gcactaaacg 
cgcggggaca ggaggactac accaactttg 
caccgcaaag tgaggtgtac cagtctgggc 
gcctgcagac cgtaaacctg agccaggctt 
gggctcccac aggcgaccgc gcgaccgtgt 
tgctgctgct aatagcgccc ttcacggaca 
gtcacttgct gacactgtac cgcgaggcca 
tccaggagat tacaagtgtc agccgcgcgc 
caaccctaaa ctacctgctg accaaccggc 
acagcgagga ggagcgcatt ttgcgctacg 
gcgacggggt aacgcccagc gtggcgctgg 
tgtatgcctc aaaccggccg tttatcaacc 
ccgtgaaccc cgagtatttc accaatgcca 
gtttctacac cgggggattc gaggtgcccg 
tagacgacag cgtgttttcc ccgcaaccgc 
aggcagaggc ggcgctgcga aaggaaagct 
gcgctgcggc cccgcggtca gatgctagta 
ccagcactcg caccacccgc ccgcgcctgc 
tgctgcagcc gcagcgcgaa aaaaacctgc 
gcctagtgga caagatgagt agatggaaga 
gcccgcgccc gcccacccgt cgtcaaaggc 
acgatgactc ggcagacgac agcagcgtcc 
cgcaccttcg ccccaggctg gggagaatgt 
aaaaactcac caaggccatg gcaccgagcg 
gcgcgcggcg atgtatgagg aaggtcctcc 



gccctcgagt caaatacgta gtcgttgcaa 10320 
aagtgcggcg gcggctggcg gtagaggggc 10380 
agatcttcca acataaggcg atgatatccg 10440 
gcggcggtgg tggaggcgcg cggaaagtcg 10500 
aaaaagtgct ccatggtcgg gacgctctgg 10560 
tagaccgtgc aaaaggagag cctgtaagcg 10620 
cgcaagggta tcatggcgga cgaccggggt 10680 
tccatgcggt taccgcccgc gtgtcgaacc 10740 
gctccttttg gcttccttcc aggcgcggcg 10800 
cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gccggagggt tattttccaa gggttgagtc 10920 
ccggactgcg gcgaacgggg gtttgcctcc 10980 
tccggaaaca gggacgagcc ccttttttgc 11040 
atgcgccccc ctcctcagca gcggcaagag 11100 
tcccctcctc ctaccgcgtc aggaggggcg 11160 
gattacgaac ccccgcggcg ccgggcccgg 11220 
ctggcgcggc taggagcgcc ctctcctgag 11280 
acgcgtgagg cgtacgtgcc gcggcagaac 11340 
gaggagatgc gggatcgaaa gttccacgca 11400 
gagcggttgc tgcgcgagga ggactttgag 114 60 
cgcgcacacg tggcggccgc cgacctggta 11520 
attaactttc aaaaaagctt taacaaccac 11580 
gctataggac tgatgcatct gtgggacttt 11640 
aagccgctca tggcgcagct gttccttata 11700 
agggatgcgc tgctaaacat agtagagccc 11760 
atcctgcaga gcatagtggt gcaggagcgc 11820 
atcaactatt ccatgcttag cctgggcaag 11880 
tacgttccca tagacaagga ggtaaagatc 11940 
gtgcttacct tgagcgacga cctgggcgtt 12000 
agcgtgagcc ggcggcgcga gctcagcgac 12060 
ctggctggca cgggcagcgg cgatagagag 12120 
ctgcgctggg ccccaagccg acgcgccctg 12180 
gtggcacccg cgcgcgctgg caacgtcggc 12240 
tacgagccag aggacggcga gtactaagcg 12300 
caacggaccc ggcggtgcgg gcggcgctgc 12360 
acgactggcg ccaggtcatg gaccgcatca 12420 
tccggcagca gccgcaggcc aaccggctct 12480 
gcgcaaaccc cacgcacgag aaggtgctgg 12540 
ccatccggcc cgacgaggcc ggcctggtct 12600 
acaacagcgg caacgtgcag accaacctgg 12660 
tggcgcagcg tgagcgcgcg cagcagcagg 12720 
ccttcctgag tacacagccc gccaacgtgc 12780 
tgagcgcact gcggctaatg gtgactgaga 12840 
cagactattt tttccagacc agtagacaag 12900 
tcaaaaactt gcaggggctg tggggggtgc 12960 
ctagcttgct gacgcccaac tcgcgcctgt 13020 
gtggcagcgt gtcccgggac acatacctag 13080 
taggtcaggc gcatgtggac gagcatactt 13140 
tggggcagga ggacacgggc agcctggagg 13200 
ggcagaagat cccctcgttg cacagtttaa 13260 
tgcagcagag cgtgagcctt aacctgatgc 13320 
acatgaccgc gcgcaacatg gaaccgggca 13380 
gcctaatgga ctacttgcat cgcgcggccg 13440 
tcttgaaccc gcactggcta ccgccccctg 13500 
agggtaacga tggattcctc tgggacgaca 13560 
agaccctgct agagttgcaa cagcgcgagc 13620 
tccgcaggcc aagcagcttg tccgatctag 13680 
gcccatttcc aagcttgata gggtctctta 13740 
tgggcgagga ggagtaccta aacaactcgc 13800 
ctccggcatt tcccaacaac gggatagaga 13860 
cgtacgcgca ggagcacagg gacgtgccag 13920 
acgaccgtca gcggggtctg gtgtgggagg 13980 
tggatttggg agggagtggc aacccgtttg 14040 
tttaaaaaaa aaaaagcatg atgcaaaata 14100 
ttggttttct tgtattcccc ttagtatgcg 14160 
tccctcctac gagagtgtgg tgagcgcggc 14220 
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gccagtggcg 
tccgcggtac 
cctattcgac 
gaactaccag 
cccgggggag 
cctgaaaacc 
gtttaaggcg 
atacgagtgg 
ccttatgaac 
ggaaagcgac 
cactggtctt 
gctgccagga 
caagcggcaa 
cattcccgca 
gggcgggggt 
cgcggcagcc 
cacctttgcc 
cgcccccgct 
gacagaggac 
gtaccgcagc 
gaccctgctt 
agacatgatg 
ggtgggcgcc 
ctcccaactc 
ccagattttg 
tctcacagat 
cattactgac 
gccgcgcgtc 
caataacaca 
ctccgaccaa 
acgcggccgc 
gcgcaactac 
ggtgcgcgga 
ccaccgccgc 
acgtcgcacc 
cactgtgccc 
tatgactcag 
cgtgcccgtg 
gtactgttgt 
caaagaagag 
gcaggattac 
tgaacttgac 
gaaaggtcga 
tgagcgctcc 
gcttgagcag 
gctggcgttg 
gcaggtgctg 
tgacttggca 
ggaaaaaatg 
ggtggcgccg 
cagtattgcc 
ggcggatgcc 
aacggacccg 
cggcgccgcc 
cggctatcgt 
cactggaacc 
cagggtggct 
catcgtttaa 
tttcccggtg 
cctgacgggc 
gcgcggcggt 
cggaattgca 
gaaaaatcaa 
gaatggaaga 
gaaactggca 
tgtggagcgg 



gcggcgctgg 
ctgcggccta 
accacccgtg 
aacgaccaca 
gcaagcacac 
atcctgcata 
cgggtgatgg 
gtggagttca 
aacgcgatcg 
atcggggtaa 
gtcatgcctg 
tgcggggtgg 
cccttccagg 
ctgttggatg 
ggcgcaggcg 
gcggcaatgc 
acacgggctg 
gcgcaacccg 
agcaagaaac 
tggtaccttg 
tgcactcctg 
caagaccccg 
gagctgttgc 
atccgccagt 
gcgcgcccgc 
cacgggacgc 
gccagacgcc 
ctatcgagcc 
ggctggggcc 
cacccagtgc 
actgggcgca 
acgcccacgc 
gcccggcgct 
cgacccggca 
ggccgacggg 
cccaggtcca 
ggtcgcaggg 
cgcacccgcc 
atgtatccag 
atgctccagg 
aagccccgaa 
gacgaggtgg 
cgcgtaaaac 
acccgcacct 
gccaacgagc 
ccgctggacg 
cccgcgcttg 
cccaccgtgc 
accgtggaac 
ggactgggcg 
accgccacag 
gcggtgcagg 
tggatgtttc 
agcgcgctac 
ggctacacct 
cgccgccgcc 
cgcgaaggag 
aagccggtct 
ccgggattcc 
ggcatgcgtc 
atcctgcccc 
tccgtggcct 
aataaaaagt 
catcaacttt 
agatatcggc 
cattaaaaat 



gttctccctt 
ccggggggag 
tgtacctggt 
gcaactttct 
agaccatcaa 
ccaacatgcc 
tgtcgcgctt 
cgctgcccga 
tggagcacta 
agtttgacac 
gggtatatac 
acttcaccca 
agggctttag 
tggacgccta 
gcagcaacag 
agccggtgga 
aggagaagcg 
aggtcgagaa 
gcagttacaa 
catacaacta 
acgtaacctg 
tgaccttccg 
ccgtgcactc 
ttacctctct 
cagcccccac 
taccgctgcg 
gcacctgccc 
gcactttttg 
tgcgcttccc 
gcgtgcgcgg 
ccaccgtcga 
cgccaccagt 
atgctaaaat 
ctgccgccca 
cggccatgcg 
ggcgacgagc 
gcaacgtgta 
ccccgcgcaa 
cggcggcggc 
tcatcgcgcc 
agctaaagcg 
aactgctgca 
gtgttttgcg 
acaagcgcgt 
gcctcgggga 
agggcaaccc 
caccgtccga 
agctgatggt 
ctgggctgga 
tgcagaccgt 
agggcatgga 
cggtcgctgc 
gcgtttcagc 
tgcccgaata 
accgccccag 
gtcgccgtcg 
gcaggaccct 
ttgtggttct 
gaggaagaat 
gtgcgcacca 
tccttattcc 
tgcaggcgca 
ctggactctc 
gcgtctctgg 
accagcaata 
ttcggttcca 



cgatgctccc 
aaacagcatc 
ggacaacaag 
gaccacggtc 
tcttgacgac 
aaatgtgaac 
gcctactaag 
gggcaactac 
cttgaaagtg 
ccgcaacttc 
aaacgaagcc 
cagccgcctg 
gatcacctac 
ccaggcgagc 
cagtggcagc 
ggacatgaac 
cgctgaggcc 
gcctcagaag 
cctaataagc 
cggcgaccct 
cggctcggag 
ctccacgcgc 
caagagcttc 
gacccacgtg 
catcaccacc 
caacagcatc 
ctacgtttac 
agcaagcatg 
aagcaagatg 
gcactaccgc 
tgacgccatc 
gtccacagtg 
gaagagacgg 
acgcgcggcg 
ggccgctcga 
ggccgccgca 
ttgggtgcgc 
ctagattgca 
gcgcaacgaa 
ggagatctat 
ggtcaaaaag 
cgctaccgcg 
acccggcacc 
gtatgatgag 
gtttgcctac 
aacacctagc 
agaaaagcgc 
acccaagcgc 
gcccgaggtc 
ggacgttcag 
gacacaaacg 
ggccgcgtcc 
cccccggcgc 
tgccctacat 
aagacgagca 
ccagcccgtg 
ggtgctgcca 
tgcagatatg 
gcaccgtagg 
ccggcggcgg 
actgatcgcc 
gagacactga 
acgctcgctt 
ccccgcgaca 
tgagcggtgg 
ccgttaagaa 



ctggacccgc 

cgttactctg 

tcaacggatg 

attcaaaaca 

cggtcgcact 

gagttcatgt 

gacaatcagg 

tccgagacca 

ggcagacaga 

agactggggt 

ttccatccag 

agcaacttgt 

gatgatctgg 

ttgaaagatg 

ggcgcggaag 

gatcatgcca 

gaagcagcgg 

aaaccggtga 

aatgacagca 

cagaccggaa 

caggtctact 

cagatcagca 

tacaacgacc 

ttcaatcgct 

gtcagtgaaa 

ggaggagtcc 

aaggccctgg 

tccatcctta 

tttggcgggg 

gcgccctggg 

gacgcggtgg 

gacgcggcca 

cggaggcgcg 

gcggccctgc 

aggctggccg 

gcagccgcgg 

gactcggtta 

agaaaaaact 

gctatgtcca 

ggccccccga 

aaaaagaaag 

cccaggcgac 

accgtagtct 

gtgtacggcg 

ggaaagcggc 

ctaaagcccg 

ggcctaaagc 

cagcgactgg 

cgcgtgcggc 

atacccacta 

tccccggttg 

aagacctcta 

ccgcgcggtt 

ccttccattg 

actacccgac 

ctggccccga 

acagcgcgct 

gccctcacct 

aggggcatgg 

cgcgcgtcgc 

gcggcgattg 

ttaaaaacaa 

ggtcctgtaa 

cggctcgcgc 

cgccttcagc 

ctatggcagc 



cgtttgtgcc 

agttggcacc 

tggcatccct 

atgactacag 

ggggcggcga 

ttaccaataa 

tggagctgaa 

tgaccataga 

acggggttct 

ttgaccccgt 

acatcatttt 

tgggcatccg 

agggtggtaa 

acaccgaaca 

agaactccaa 

ttcgcggcga 

ccgaagctgc 

tcaaacccct 

ccttcaccca 

tccgctcatg 

ggtcgttgcc 

actttccggt 

aggccgtcta 

ttcccgagaa 

acgttcctgc 

agcgagtgac 

gcatagtctc 

tatcgcccag 

ccaagaagcg 

gcgcgcacaa 

tggaggaggc 

ttcagaccgt 

tagcacgtcg 

ttaaccgcgc 

cgggtattgt 

ccattagtgc 

gcggcctgcg 

acttagactc 

agcgcaaaat 

agaaggaaga 

atgatgatga 

gggtacagtg 

ttacgcccgg 

acgaggacct 

ataaggacat 

taacactgca 

gcgagtctgg 

aagatgtctt 

caatcaagca 

ccagtagcac 

cctcagcggt 

cggaggtgca 

cgaggaagta 

cgcctacccc 

gccgaaccac 

tttccgtgcg 

accaccccag 

gccgcctccg 

ccggccacgg 

accgtcgcat 

gcgccgtgcc 

gttgcatgtg 

ctattttgta 

ccgttcatgg 

tggggctcgc 

aaggcctgga 



14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 
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acagcagcac aggccagatg ctgagggata 
tggtagatgg cctggcctct ggcattagcg 
aaaataagat taacagtaag cttgatcccc 
tggagacagt gtctccagag gggcgtggcg 
ctctggtgac gcaaatagac gagcctccct 
ccaccacccg tcccatcgcg cccatggcta 
cgctggacct gcctcccccc gccgacaccc 
ccgttgttgt aacccgtcct agccgcgcgt 
cgttgcggcc cgtagccagt ggcaactggc 
gggtgcaatc cctgaagcgc cgacgatgct 
atgtatgcgt ccatgtcgcc gccagaggag 
gatggctacc ccttcgatga tgccgcagtg 
ctcggagtac ctgagccccg ggctggtgca 
cctgaataac aagtttagaa accccacggt 
gtcccagcgt ttgacgctgc ggttcatccc 
caaggcgcgg ttcaccctag ctgtgggtga 
ctttgacatc cgcggcgtgc tggacagggg 
ctacaacgcc ctggctccca agggtgcccc 
tgctcttgaa ataaacctag aagaagagga 
agctgagcag caaaaaactc acgtatttgg 
aaaggagggt attcaaatag gtgtcgaagg 
tcaacctgaa cctcaaatag gagaatctca 
tgggagagtc cttaaaaaga ctaccccaat 
cacaaatgaa aatggagggc aaggcattct 
tcaagtggaa atgcaatttt tctcaactac 
gactcctaaa gtggtattgt acagtgaaga 
ttcttacatg cccactatta aggaaggtaa 
gcccaacagg cctaattaca ttgcttttag 
cagcacgggt aatatgggtg ttctggcggg 
tttgcaagac agaaacacag agctttcata 
aaccaggtac ttttctatgt ggaatcaggc 
tattgaaaat catggaactg aagatgaact 
gattaataca gagactctta ccaaggtaaa 
aaaagatgct acagaatttt cagataaaaa 
ggaaatcaat ctaaatgcca acctgtggag 
tttgcccgac aagctaaagt acagtccttc 
ctacgactac atgaacaagc gagtggtggc 
tggagcacgc tggtcccttg actatatgga 
tgctggcctg cgctaccgct caatgttgct 
ccaggtgcct cagaagttct ttgccattaa 
ctacgagtgg aacttcagga aggatgttaa 
cctaagggtt gacggagcca gcattaagtt 
ccccatggcc cacaacaccg cctccacgct 
ccagtccttt aacgactatc tctccgccgc 
taccaacgtg cccatatcca tcccctcccg 
cacgcgcctt aagactaagg aaaccccatc 
ctactctggc tctataccct acctagatgg 
ggtggccatt acctttgact cttctgtcag 
caacgagttt gaaattaagc gctcagttga 
catgaccaaa gactggttcc tggtacaaat 
cttctatatc ccagagagct acaaggaccg 
catgagccgt caggtggtgg atgatactaa 
acaccaacac aacaactctg gatttgttgg 
ggcctaccct gctaacttcc cctatccgct 
ccagaaaaag tttctttgcg atcgcaccct 
gtccatgggc gcactcacag acctgggcca 
gctagacatg acttttgagg tggatcccat 
tgaagtcttt gacgtggtcc gtgtgcaccg 
cctgcgcacg cccttctcgg ccggcaacgc 
acagctgccg ccatgggctc cagtgagcag 
tgtgggccat attttttggg cacctatgac 
aagctcgcct gcgccatagt caatacggcc 
gcctttgcct ggaacccgca ctcaaaaaca 
gaccagcgac tcaagcaggt ttaccagttt 
attgcttctt cccccgaccg ctgtataacg 
cccaactcgg ccgcctgtgg actattctgc 



agttgaaaga gcaaaatttc caacaaaagg 18240 
gggtggtgga cctggccaac caggcagtgc 18300 
gccctcccgt agaggagcct ccaccggccg 18360 
aaaagcgtcc gcgccccgac agggaagaaa 18420 
cgtacgagga ggcactaaag caaggcctgc 18480 
ccggagtgct gggccagcac acacccgtaa 18540 
agcagaaacc tgtgctgcca ggcccgaccg 18600 
ccctgcgccg cgccgccagc ggtccgcgat 18660 
aaagcacact gaacagcatc gtgggtctgg 18720 
tctgaatagc taacgtgtcg tatgtgtgtc 18780 
ctgctgagcc gccgcgcgcc cgctttccaa 18840 
gtcttacatg cacatctcgg gccaggacgc 18900 
gtttgcccgc gccaccgaga cgtacttcag 18960 
ggcgcctacg cacgacgtga ccacagaccg 19020 
tgtggaccgt gaggatactg cgtactcgta 19080 
taaccgtgtg ctggacatgg cttccacgta 19140 
ccctactttt aagccctact ctggcactgc 19200 
aaatccttgc gaatgggatg aagctgctac 19260 
cgatgacaac gaagacgaag tagacgagca 19320 
gcaggcgcct tattctggta taaatattac 19380 
tcaaacacct aaatatgccg ataaaacatt 19440 
gtggtacgaa actgaaatta atcatgcagc 19500 
gaaaccatgt tacggttcat atgcaaaacc 19560 
tgtaaagcaa caaaatggaa agctagaaag 19620 
tgaggcgacc gcaggcaatg gtgataactt 19680 
tgtagatata gaaaccccag acactcatat 19740 
ctcacgagaa ctaatgggcc aacaatctat 19800 
ggacaatttt attggtctaa tgtattacaa 19860 
ccaagcatcg cagttgaatg ctgttgtaga 19920 
ccagcttttg cttgattcca ttggtgatag 19980 
tgttgacagc tatgatccag atgttagaat 20040 
tccaaattac tgctttccac tgggaggtgt 20100 
acctaaaaca ggtcaggaaa atggatggga 20160 
tgaaataaga gttggaaata attttgccat 20220 
aaatttcctg tactccaaca tagcgctgta 20280 
caacgtaaaa atttctgata acccaaacac 20340 
tcccgggtta gtggactgct acattaacct 20400 
caacgtcaac ccatttaacc accaccgcaa 204 60 
gggcaatggt cgctatgtgc ccttccacat 20520 
aaacctcctt ctcctgccgg gctcatacac 20580 
catggttctg cagagctccc taggaaatga 20640 
tgatagcatt tgcctttacg ccaccttctt 20700 
tgaggccatg cttagaaacg acaccaacga 20760 
caacatgctc taccctatac ccgccaacgc 20820 
caactgggcg gctttccgcg gctgggcctt 20880 
actgggctcg ggctacgacc cttattacac 20940 
aaccttttac ctcaaccaca cctttaagaa 21000 
ctggcctggc aatgaccgcc tgcttacccc 21060 
cggggagggt tacaacgttg cccagtgtaa 21120 
gctagctaac tacaacattg gctaccaggg 21180 
catgtactcc ttctttagaa acttccagcc 21240 
atacaaggac taccaacagg tgggcatcct 21300 
ctaccttgcc cccaccatgc gcgaaggaca 21360 
tataggcaag accgcagttg acagcattac 21420 
ttggcgcatc ccattctcca gtaactttat 21480 
aaaccttctc tacgccaact ccgcccacgc 21540 
ggacgagccc acccttcttt atgttttgtt 21600 
gccgcaccgc ggcgtcatcg aaaccgtgta 21660 
cacaacataa agaagcaagc aacatcaaca 21720 
gaactgaaag ccattgtcaa agatcttggt 21780 
aagcgctttc caggctttgt ttctccacac 21840 
ggtcgcgaga ctgggggcgt acactggatg 21900 
tgctacctct ttgagccctt tggcttttct 21960 
gagtacgagt cactcctgcg ccgtagcgcc 22020 
ctggaaaagt ccacccaaag cgtacagggg 22080 
tgcatgtttc tccacgcctt tgccaactgg 22140 
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ccccaaactc ccatggatca caaccccacc atgaacctta ttaccggggt acccaactcc 22200 
atgctcaaca gtccccaggt acagcccacc ctgcgtcgca accaggaaca gctctacagc 22260 
ttcctggagc gccactcgcc ctacttccgc agccacagtg cgcagattag gagcgccact 22320 
tctttttgtc acttgaaaaa catgtaaaaa taatgtacta gagacacttt caataaaggc 22380 
aaatgctttt atttgtacac tctcgggtga ttatttaccc ccacccttgc cgtctgcgcc 22440 
gtttaaaaat caaaggggtt ctgccgcgca tcgctatgcg ccactggcag ggacacgttg 22500 
cgatactggt gtttagtgct ccacttaaac tcaggcacaa ccatccgcgg cagctcggtg 22560 
aagttttcac tccacaggct gcgcaccatc accaacgcgt ttagcaggtc gggcgccgat 22620 
atcttgaagt cgcagttggg gcctccgccc tgcgcgcgcg agttgcgata cacagggttg 22680 
cagcactgga acactatcag cgccgggtgg tgcacgctgg ccagcacgct cttgtcggag 22740 
atcagatccg cgtccaggtc ctccgcgttg ctcagggcga acggagtcaa ctttggtagc 22800 
tgccttccca aaaagggcgc gtgcccaggc tttgagttgc actcgcaccg tagtggcatc 22860 
aaaaggtgac cgtgcccggt ctgggcgtta ggatacagcg cctgcataaa agccttgatc 22920 
tgcttaaaag ccacctgagc ctttgcgcct tcagagaaga acatgccgca agacttgccg 22980 
gaaaactgat tggccggaca ggccgcgtcg tgcacgcagc accttgcgtc ggtgttggag 23040 
atctgcacca catttcggcc ccaccggtrc ttcacgatct tggccttgct agactgctcc 23100 
ttcagcgcgc gctgcccgtt ttcgctcgtc acatccattt caatcacgtg ctccttattt 23160 
atcataatgc ttccgtgtag acacttaagc tcgccttcga tctcagcgca gcggtgcagc 23220 
cacaacgcgc agcccgtggg ctcgtgatgc ttgtaggtca cctctgcaaa cgactgcagg 23280 
tacgcctgca ggaatcgccc catcatcgtc acaaaggtct tgttgctggt gaaggtcagc 2334 0 
tgcaacccgc ggtgctcctc gttcagccag gtcttgcata cggccgccag agcttccact 23400 
tggtcaggca gtagtttgaa gttcgccttt agatcgttat ccacgtggta cttgtccatc 23460 
agcgcgcgcg cagcctccat gcccttctcc cacgcagaca cgatcggcac actcagcggg 23520 
ttcatcaccg taatttcact ttccgcttcg ctgggctctt cctcttcctc ttgcgtccgc 23580 
ataccacgcg ccactgggtc gtcttcattc agccgccgca ctgtgcgctt acctcctttg 23640 
ccatgcttga ttagcaccgg tgggttgctg aaacccacca tttgtagcgc cacatcttct 23700 
ctttcttcct cgctgtccac gattacctct ggtgatggcg ggcgctcggg cttgggagaa 23760 
gggcgcttct ttttcttctt gggcgcaatg gccaaatccg ccgccgaggt cgatggccgc 23820 
gggctgggtg tgcgcggcac cagcgcgtct tgtgatgagt cttcctcgtc ctcggactcg 23880 
atacgccgcc tcatccgctt ttttgggggc gcccggggag gcggcggcga cggggacggg 23940 
gacgacacgt cctccatggt tgggggacgt cgcgccgcac cgcgtccgcg ctcgggggtg 24000 
gtttcgcgct gctcctcttc ccgactggcc atttccttct cctataggca gaaaaagatc 24060 
atggagtcag tcgagaagaa ggacagccta accgccccct ctgagttcgc caccaccgcc 24120 
tccaccgatg ccgccaacgc gcctaccacc ttccccgtcg aggcaccccc gcttgaggag 24180 
gaggaagtga ttatcgagca ggacccaggt tttgtaagcg aagacgacga ggaccgctca 24240 
gtaccaacag aggataaaaa gcaagaccag gacaacgcag aggcaaacga ggaacaagtc 24300 
gggcgggggg acgaaaggca tggcgactac ctagatgtgg gagacgacgt gctgttgaag 24360 
catctgcagc gccagtgcgc cattatctgc gacgcgttgc aagagcgcag cgatgtgccc 24420 
ctcgccatag cggatgtcag ccttgcctac gaacgccacc tattctcacc gcgcgtaccc 24480 
cccaaacgcc aagaaaacgg cacatgcgag cccaacccgc gcctcaactt ctaccccgta 24540 
tttgccgtgc cagaggtgct tgccacctat cacatctttt tccaaaactg caagataccc 24600 
ctatcctgcc gtgccaaccg cagccgagcg gacaagcagc tggccttgcg gcagggcgct 24 660 
gtcatacctg atatcgcctc gctcaacgaa gtgccaaaaa tctttgaggg tcttggacgc 24720 
gacgagaagc gcgcggcaaa cgctctgcaa caggaaaaca gcgaaaatga aagtcactct 24780 
ggagtgttgg tggaactcga gggtgacaac gcgcgcctag ccgtactaaa acgcagcatc 24840 
gaggtcaccc actttgccta cccggcactt aacctacccc ccaaggtcat gagcacagtc 24900 
atgagtgagc tgatcgtgcg ccgtgcgcag cccctggaga gggatgcaaa tttgcaagaa 24960 
caaacagagg agggcctacc cgcagttggc gacgagcagc tagcgcgctg gcttcaaacg 25020 
cgcgagcctg ccgacttgga ggagcgacgc aaactaatga tggccgcagt gctcgttacc 25080 
gtggagcttg agtgcatgca gcggttcttt gctgacccgg agatgcagcg caagctagag 25140 
gaaacattgc actacacctt tcgacagggc tacgtacgcc aggcctgcaa gatctccaac 25200 
gtggagctct gcaacctggt ctcctacctt ggaattttgc acgaaaaccg ccttgggcaa 25260 
aacgtgcttc attccacgct caagggcgag gcgcgccgcg actacgtccg cgactgcgtt 25320 
tacttatttc tatgctacac ctggcagacg gccatgggcg tttggcagca gtgcttggag 25380 
gagtgcaacc tcaaggagct gcagaaactg ctaaagcaaa acttgaagga cctatggacg 254 40 
gccttcaacg agcgctccgt ggccgcgcac ctggcggaca tcattttccc cgaacgcctg 25500 
cttaaaaccc tgcaacaggg tctgccagac ttcaccagtc aaagcatgtt gcagaacttt 25560 
aggaacttta tcctagagcg ctcaggaatc ttgcccgcca cctgctgtgc acttcctagc 25620 
gactttgtgc ccattaagta ccgcgaatgc cctccgccgc tttggggcca ctgctacctt 25680 
ctgcagctag ccaactacct tgcctaccac tctgacataa tggaagacgt gagcggtgac 25740 
ggtctactgg agtgtcactg tcgctgcaac ctatgcaccc cgcaccgctc cctggtttgc 25800 
aattcgcagc tgcttaacga aagtcaaatt atcggtacct ttgagctgca gggtccctcg 25860 
cctgacgaaa agtccgcggc tccggggttg aaactcactc cggggctgtg gacgtcggct 25920 
taccttcgca aatttgtacc tgaggactac cacgcccacg agattaggtt ctacgaagac 25980 
caatcccgcc cgccaaatgc ggagcttacc gcctgcgtca ttacccaggg ccacattctt 26040 
ggccaattgc aagccatcaa caaagcccgc caagagtttc tgctacgaaa gggacggggg 26100 



wo 01/04282 



34 



PCT/USOO/18971 



gtttacttgg acccccagtc cggcgaggag 
tatcagcagc agccgcgggc ccttgcttcc 
gccgccgcca cccacggacg aggaggaata 
cgaggaggag gaggacatga tggaagactg 
cgaagaggtg tcagacgaaa caccgtcacc 
gaaatcggca accggttcca gcatggctac 
gcccgttcgc cgacccaacc gtagatggga 
gcagccgccg ccgttagccc aagagcaaca 
gcacaagaac gccatagttg ctrgcttgca 
ccgctttctt ctctaccatc acggcgtggc 
tcatctctac agcccatact gcaccggcgg 
cacagaagca aaggcgaccg gatagcaaga 
cggcagcagc aggaggagga gcgctgcgtc 
agcttagaaa caggattttt cccactctgt 
aacaagagct gaaaataaaa aacaggtctc 
acaaaagcga agatcagctt cggcgcacgc 
actgcgcgct gactcttaag gactagtttc 
tacgtcatct ccagcggcca cacccggcgc 
aggaaattcc cacgccctac atgtggagtt 
ctgcccaaga ctactcaacc cgaataaact 
gggtcaacgg aatccgcgcc caccgaaacc 
ccacacctcg taataacctt aatccccgta 
gtcccgctcc caccactgtg gtacttccca 
actcaggggc gcagcttgcg ggcggctttc 
taactcacct gacaatcaga gggcgaggta 
cgcttggtct ccgtccggac gggacatttc 
cgcctcgtca ggcaatccta actctgcaga 
ttggaactct gcaatttatt gaggagtttg 
gacctcccgg ccactatccg gatcaattta 
cggacggcta cgactgaatg ttaagtggag 
tccactgtcg ccgccacaag tgctttgccc 
tgcccgagga tcatatcgag ggcccggcgc 
ttgcccgtag cctgattcgg gagtttaccc 
gaccctgtgt tctcactgtg atttgcaact 
gttgccatct ctgtgctgag tataataaat 
cgccatcctg taaacgccac cgtcttcacc 
ggtactttta acatctctcc ctctgtgatt 
ctacgagaga acctctccga gctcagctac 
tgccgggaac gtacgagtgc gtcaccggcc 
ccagactttt tccggacaga cctcaataac 
aaaaccctta gggtattagg ccaaaggcgc 
caactctacg ggctattcta attcaggttt 
atctgacttt ggccagcacc tgtcccgcgg 
cctaacagag atgaccaaca caaccaacgc 
aaatacaccc caagtttctg cctttgtcaa 
ctccatagcg cttatgtttg tatgccttat 
caaacgcgcc cgaccaccca tctatagtcc 
aatccataga ttggacggac tgaaacacat 
gatctagaaa tggacggaat tattacagag 
gccgagcaac agcgcatgaa tcaagagctc 
aggggtatct tttgtctggt aaagcaggcc 
caccgcctta gctacaagtt gccaaccaag 
aagcccatta ccataactca gcactcggta 
caaggacctg aggatctctg cacccttatt 
ccctttaact aataaaaaaa aataataaag 
tctgtccagt ttattcagca gcacctcctt 
cctcctggct gcaaactttc tccacaatct 
tccatccgca cccactatct tcatgttgtt 
taccttcaac cccgtgtatc catatgacac 
tactcctccc tttgtatccc ccaatgggtt 
gcgcctatcc gaacctctag ttacctccaa 
cctctctctg gacgaggccg gcaaccttac 
tctcaaaaaa accaagtcaa acataaacct 
agaagcccta actgtggctg ccgccgcacc 
gcaatcacag gccccgctaa ccgtgcacga 
cctcacagtg tcagaaggaa agctagccct 



ctcaacccaa tccccccgcc gccgcagccc 26160 
caggatggca cccaaaaaga agctgcagct 26220 
ctgggacagt caggcagagg aggttttgga 26280 
ggagagccta gacgaggaag cttccgaggt 26340 
ctcggtcgca ttcccctcgc cggcgcccca 26400 
aacctccgct cctcaggcgc cgccggcact 26460 
caccactgga accagggccg gtaagtccaa 26520 
acagcgccaa ggctaccgct catggcgcgg 26580 
agactgtggg ggcaacatct ccttcgcccg 26640 
cttcccccgt aacatcctgc attactaccg 26700 
cagcggcagc ggcagcaaca gcagcggcca 26760 
ctctgacaaa gcccaagaaa tccacagcgg 26820 
tggcgcccaa cgaacccgta tcgacccgcg 26880 
atgctatatt tcaacagagc aggggccaag 26940 
tgcgatccct cacccgcagc tgcctgtatc 27000 
tggaagacgc ggaggctctc ttcagtaaat 27060 
gcgccctttc tcaaatttaa gcgcgaaaac 27120 
cagcacctgt cgtcagcgcc attatgagca 27180 
accagccaca aatgggactt gcggctggag 27240 
acatgagcgc gggaccccac atgatatccc 27300 
gaattctctt ggaacaggcg gctattacca 27360 
gttggcccgc tgccctggtg taccaggaaa 27420 
gagacgccca ggccgaagtt cagatgacta 27480 
gtcacagggt gcggtcgccc gggcagggta 2754 0 
ttcagctcaa cgacgagtcg gtgagctcct 27 600 
agatcggcgg cgccggccgt ccttcattca 27 660 
cctcgtcctc tgagccgcgc tctggaggca 27720 
tgccatcggt ctactttaac cccttctcgg 27780 
ttcctaactt tgacgcggta aaggactcgg 2784 0 
aggcagagca actgcgcctg aaacacctgg 27 900 
gcgactccgg tgagttttgc tactttgaat 27 960 
acggcgtccg gcttaccgcc cagggagagc 28020 
agcgccccct gctagttgag cgggacaggg 28080 
gtcctaacct tggattacat caagatcttt 28140 
acagaaatta aaatatactg gggctcctat 28200 
cgcccaagca aaccaaggcg aaccttacct 28260 
tacaacagtt tcaacccaga cggagtgagt 28320 
tccatcagaa aaaacaccac cctccttacc 28380 
gctgcaccac acctaccgcc tgaccgtaaa 28440 
tctgtttacc agaacaggag gtgagcttag 28500 
agctactgtg gggtttatga acaattcaag 28560 
ctctagaagt caggcttcct ggatgtcagc 28620 
atttgttcca gtccaactac agcgacccac 28680 
ggccgccgct accggactta catctaccac 28740 
taactgggat aacttgggca tgtggtggtt 28800 
tattatgtgg ctcatctgct gcctaaagcg 28860 
catcattgtg ctacacccaa acaatgatgg 28920 
gttcttttct cttacagtat gattaaatga 28980 
cagcgcctgc tagaaagacg cagggcagcg 29040 
caagacatgg ttaacttgca ccagtgcaaa 29100 
aaagtcacct acgacagtaa taccaccgga 29160 
cgtcagaaat tggtggtcat ggtgggagaa 29220 
gaaaccgaag gctgcattca ctcaccttgt 29280 
aagaccctgt gcggtctcaa agatcttatt 29340 
catcacttac ttaaaatcag ttagcaaatt 29400 
gccctcctcc cagctctggt attgcagctt 29460 
aaatggaatg tcagtttcct cctgttcctg 29520 
gcagatgaag cgcgcaagac cgtctgaaga 29580 
ggaaaccggt cctccaactg tgccttttct 29640 
tcaagagagt ccccctgggg tactctcttt 29700 
tggcatgctt gcgctcaaaa tgggcaacgg 29760 
ctcccaaaat gtaaccactg tgagcccacc 29820 
ggaaatatct gcacccctca cagttacctc 29880 
tctaatggtc gcgggcaaca cactcaccat 2994 0 
ctccaaactt agcattgcca cccaaggacc 30000 
gcaaacatca ggccccctca ccaccaccga 30060 
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nagcagtacc cttactatca ctgcctcacc ccctctaact actgccactg gtagcttggg 30120 
cattgacttg aaagagccca tttatacaca aaatggaaaa ctaggactaa agtacggggc 30180 
tcctttgcat gtaacagacg acctaaacac tttgaccgta gcaactggtc caggtgtgac 30240 
tattaataat acttccttgc aaactaaagt tactggagcc ttgggttttg attcacaagg 30300 
caatatgcaa cttaatgtag caggaggact aaggattgat tctcaaaaca gacgccttat 30360 
acttgatgtt agttatccgt ttgatgctca aaaccaacta aatctaagac taggacaggg 30420 
ccctcttttt ataaactcag cccacaactt ggatattaac tacaacaaag gcctttactt 30480 
gtttacagct tcaaacaatt ccaaaaagct tgaggttaac ctaagcactg ccaaggggtt 30540 
gatgtttgac gctacagcca tagccattaa tgcaggagat gggcttgaat ttggttcacc 30600 
taatgcacca aacacaaatc ccctcaaaac aaaaattggc catggcctag aatttgattc 30660 
aaacaaggct atggttccta aactaggaac tggccttagt tttgacagca caggtgccat 30720 
tacagtagga aacaaaaata atgataagct aactttgtgg accacaccag ctccatctcc 30780 
taactgtaga ctaaatgcag agaaagatgc taaactcact ttggtcttaa caaaatgtgg 3084 0 
cagtcaaata cttgctacag tttcagtttt ggctgttaaa ggcagtttgg ctccaatatc 30900 
tggaacagtt caaagtgctc atcttattat aagatttgac gaaaatggag tgctactaaa 30960 
caattccttc ctggacccag aatattggaa ctttagaaat ggagatctta ctgaaggcac 31020 
agcctataca aacgctgttg gatttatgcc taacctatca gcttatccaa aatctcacgg 31080 
taaaactgcc aaaagtaaca ttgtcagtca agtttactta aacggagaca aaactaaacc 31140 
tgtaacacta accattacac taaacggtac acaggaaaca ggagacacaa ctccaagtgc 31200 
atactctatg tcattttcat gggactggtc tggccacaac tacattaatg aaatatttgc 31260 
cacatcctct tacacttttt catacattgc ccaagaataa agaatcgttt gtgttatgtt 31320 
tcaacgtgtt tatttttcaa ttgcagaaaa tttcaagtca tttttcattc agtagtatag 31380 
ccccaccacc acatagctta tacagatcac cgtaccttaa tcaaactcac agaaccctag 31440 
tattcaacct gccacctccc tcccaacaca cagagtacac agtcctttct ccccggctgg 31500 
ccttaaaaag catcatatca tgggtaacag acatattctt aggtgttata ttccacacgg 31560 
tttcctgtcg agccaaacgc tcatcagtga tattaataaa ctccccgggc agctcactta 31620 
agttcatgtc gctgtccagc tgctgagcca caggctgctg tccaacttgc ggttgcttaa 31680 
cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg tgcatcagga 31740 
tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc tccgtcctgc 31800 
aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc agcataaggc 31860 
gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca cagtaactgc 31920 
agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat ccaaagctca 31980 
tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag attaagtggc 32040 
gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg taattcacca 32100 
cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc atcctaaacc 32160 
agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg gaacaatgac 32220 
agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata tcaatgttgg 32280 
cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc cgcgttagaa 32340 
ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg cagggaagac 32400 
ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc agcggatgat 32460 
cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc ctactgtacg 32520 
gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat ggaacgccgg 32580 
acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga tctgcgtctc 3264 0 
cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc tctcaaagca 32700 
tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc tgccctgata 32760 
acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt ctgcgagtca 32820 
cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt ccaaaagatt 32880 
atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg tggcgtggtc 32940 
aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa tggcttccaa 33000 
aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag ggtgaatctc 33060 
ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc gccaccttct 33120 
caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa tctgctccag 33180 
agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc aggttcctca 3324 0 
cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc ccgtaggtcc 33300 
cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc ggccacttcc 33360 
ccgccaggaa ccttgacaaa agaacccaca ctgattatga cacgcatact cggagctatg 33420 
ctaaccagcg tagccccgat gtaagctttg ttgcatgggc ggcgatataa aatgcaaggt 33480 
gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa agcacatcgt agtcatgctc 33540 
atgcagataa aggcaggtaa gctccggaac caccacagaa aaagacacca tttttctctc 33600 
aaacatgtct gcgggtttct gcataaacac aaaataaaat aacaaaaaaa catttaaaca 33660 
ttagaagcct gtcttacaac aggaaaaaca acccttataa gcataagacg gactacggcc 33720 
atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa aaagcaccac cgacagctcc 33780 
tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca catcaggttg attcatcggt 33840 
cagtgctaaa aagcgaccga aatagcccgg gggaatacat acccgcaggc gtagagacaa 33900 
cattacagcc cccataggag gtataacaaa attaatagga gagaaaaaca cataaacacc 33960 
tgaaaaaccc tcctgcctag gcaaaatagc accctcccgc tccagaacaa catacagcgc 34020 
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ttcacagcgg cagcctaaca gtcagcctta ccagtaaaaa agaaaaccta ttaaaaaaac 34080 

accactcgac acggcaccag ctcaatcagt cacagtgtaa aaaagggcca agtgcagagc 34140 

gagtatatat aggactaaaa aatgacgtaa cggttaaagt ccacaaaaaa cacccagaaa 34200 

accgcacgcg aacctacgcc cagaaacgaa agccaaaaaa cccacaactt cctcaaatcg 34260 

tcacttccgt tttcccacgt tacgtaactt cccattttaa gaaaactaca attcccaaca 34320 

catacaagtt actccgccct aaaacctacg tcacccgccc cgttcccacg ccccgcgcca 34380 

cgtcacaaac tccaccccct cattatcata ttggcttcaa tccaaaataa ggtatattat 34440 

tgatgatg 34448 



<210> 5 
<211> 94 
<212> PRT 

<213> Adenovirus subgroup C 



<400> 5 

Met Val Asp Thr Val Asn Ser Tyr 
1 5 

Ala Leu Asn Leu Pro Gin Val Ser 
20 

Leu Gly Met Tro Trp Phe Ser lie 

35 40 

He Met Trp Leu Ser Cys Cys Leu 
50 55 

He Tyr Lys Pro He He Val Leu 
65 70 

Arg Leu Asp Gly Leu Asn Thr Cys 
85 



Asn Thr Ala Thr Gly Leu Thr Ser 
10 15 

Thr Phe Val Asn Asn Trp Ala Asn 
25 30 

Ala Leu Met Phe Val Cys Leu He 
45 

Lys Arg Lys Arg Ala Arg Pro Pro 
60 

Asn Pro Asn Asn Asp Gly He His 
75 80 

Ser Phe Ser Phe Ala Val 
90 



<210> 6 
<211> 101 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 6 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu Asn Pro 
65 70 75 80 

His Asn Glu Lys He His Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu 
85 90 95 

Leu Leu Gin Tyr Asp 
100 



<210> 7 
<211> 93 
<212> PRT 

<213> Adenovirus subgroup C 
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<400> 7 

Met Thr Asn Thr Thr Asn Ala Ala 
1 5 

Thr Asn Thr Pro Gin Val Ser Ala 
20 

Gly Met Trp Trp Phe Ser lie Ala 
35 40 

Met Trp Leu He Cys Cys Leu Lys 
50 55 

Tyr Ser Pro He He Val Leu His 
65 7C 

Leu Asp Gly Leu Lys His Met Phe 
85 



Ala Ala Thr Gly Leu Thr Ser Thr 
10 15 

Phe Val Asn Asn Trp Asp Asn Leu 

25 30 

Leu Met Phe Val Cys Leu He He 
45 

Arg Lys Arg Ala Arg Pro Pro He 

60 

Pre Asn Asn Asp Gly He His Arg 
75 80 

Phe Ser Leu Thr Val 
90 



<210> 8 
<211> 95 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 8 

Met Val Asp Thr Val Asn Ser Tyr Asn Thr Ala Thr Gly Leu Lys Ser 
1 5 10 15 

Ala Leu Asn Leu Pro Gin Val His Ala Phe Val Asn Asp Trp Ala Ser 
20 25 30 

Leu Gly Met Trp Trp Phe Ser He Ala Leu Met Phe Val Cys Leu He 
35 40 45 

He Met Trp Leu He Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro 
50 55 60 

He Tyr Arg Pro He He Val Leu Asn Pro His Asn Glu Lys He His 
65 70 75 80 

Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
85 90 95 



<210> 9 
<211> 78 
<212> PRT 

<213> Adenovirus subgroup C 



<400> 9 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 

15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 



Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu 
65 70 75 
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<210> XO 
<211> 87 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 10 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 

20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 

50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He Gly Leu Lys Pro Cys 
65 70 75 80 

Ser Leu Leu Leu Gin Tyr Asp 
85 



<210> 11 
<211> 77 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 11 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Tip Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro Ser Leu Leu Leu Gin Tyr Asp 
65 70 75 



<210> 12 
<211> 84 
<212> PRT 

<213> Adenovirus subgroup C 

<400> 12 

Met Thr Gly Ser Thr He Ala Pro 
1 5 

Ala Thr Gly Leu Thr Ser Ala Leu 
20 

Phe Val Cys Leu He He Met Trp 

35 40 



Thr Thr Asp Tyr Arg Asn Thr Thr 
10 15 

Asn Leu Pro Gin He Ala Leu Met 
25 30 

Leu He Cys Cys Leu Lys Arg Arg 
45 



Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu Asn Pro His 
50 55 60 
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Asn Glu Lys lie His Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu 
65 70 75 80 



Leu Gin Tyr Asp 



<210> 13 
<211> 35724 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 13 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 14 40 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 174 0 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 264 0 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 27 60 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
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rgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taargatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 



acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg cggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 474 0 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4920 
ccagaaggcg ctcgccgccc agcgatagca 4980 
gtttgagacc gtccgccgta ggcatgcttt 5040 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 534 0 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 54 60 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 564 0 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag ,5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
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agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 

tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 

gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 

gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 

gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 

ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 

gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 

aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 

tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 

caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 

gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 

cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 

ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 

ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 

gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 

tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 

gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 

tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 

tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 

tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 

cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 834 0 

tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 

gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 8460 

gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 

tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 

tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 

cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 

tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 

tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 

ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 

gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 

gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 

cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 

ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 

gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 

tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 

acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 

tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 

acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 

ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 

gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 

actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 

aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 

cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 

ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 

aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 

tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 

tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 

gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 

aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 

tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 

tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 

acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 

tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 

aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 

gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 104 4 0 

gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 

gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 

ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 

cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 

tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 

gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 

caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 

ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 

gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 

gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
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gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 114 60 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 1254 0 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 134 4 0 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtccc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 144 60 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14 640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
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aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 184 20 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
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ctgcgtactc gtacaaggcg cggttcaccc 
tggcttccac gtactttgac atccgcggcg 
actctggcac tgcctacaac gccctggctc 
atgaagctgc tactgctctt gaaataaacc 
aagtagacga gcaagctgag cagcaaaaaa 
gtataaatat tacaaaggag ggtattcaaa 
ccgataaaac atttcaacct gaacctcaaa 
ttaatcatgc agctgggaga gtccttaaaa 
catatgcaaa acccacaaat gaaaatggag 
gaaagctaga aagtcaagtg gaaatgcaat 
atggtgataa cttgactcct aaagtggtat 
cagacactca tatttcttac atgcccacta 
gccaacaatc tatgcccaac aggcctaatt 
taatgtatta caacagcacg ggtaatatgg 
atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 



tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 19440 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 19740 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 20460 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 2094 0 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 22740 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
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gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 

gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 

gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 

gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 

aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 

ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 

cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 

gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 

cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 

ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 

cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 

cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 

gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 

ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 237 60 

gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 

cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 

gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 

gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24 000 

cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 

cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 2 4120 

cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 

cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 

cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 

cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 

accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 

cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 

ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 

gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 

gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 

tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 

aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 

catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 

aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 

ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 

agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 

gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 

caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 

ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 

ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 

gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 

ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 

ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 254 4 0 

gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 

tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 

ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 

cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 

ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 

gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 

gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 

gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 

gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 

aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 

gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 

agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 

aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 

aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 

cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 

cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 

ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 

gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 

tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 

tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 

acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 

aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 

gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 

agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
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agctgcctgt atcacaaaag cgaagatcag 
ctcttcagta aatactgcgc gctgactctt 
taagcgcgaa aactacgtca tctccagcgg 
gccattatga gcaaggaaat tcccacgccc 
cttgcggctg gagctgccca agactactca 
cacatgatat cccgggtcaa cggaatccgc 
gcggctatta ccaccacacc tcgtaataac 
gtgtaccagg aaagtcccgc tcccaccact 
gttcagatga ctaactcagg ggcgcagctt 
cccgggcagg gtataactca cctgacaatc 
tcggtgagct cctcgcttgg tctccgtccg 
cgtccttcat tcacgcctcg tcaggcaatc 
cgctctggag gcattggaac tctgcaattt 
aaccccttct cgggacctcc cggccactat 
gtaaaggact cggcggacgg ctacgactga 
ctgaaacacc tggtccactg tcgccgccac 
tgctactttg aattgcccga ggatcatatc 
gcccagggag agcttgcccg tagcctgatt 
gagcgggaca ggggaccctg tgttctcact 
catcaagatc tttgttgcca tctctgtgct 
ctggggctcc tatcgccatc ctgtaaacgc 
gcgaacctta cctggtactt ttaacatctc 
agacggagtg agtctacgag agaacctctc 
caccctcctt acctgccggg aacgtacgag 
gcctgaccgt aaaccagact ttttccggac 
gaggtgagct tagaaaaccc ttagggtatt 
tgaacaattc aagcaactct acgggctatt 
ttctctgtct tgtgattctc tttattctta 
cctgctgtgt gcacatttgc atttattgtc 
gatgattagg tacataatcc taggtttact 
aacccaaaag gtggatttta aggagccagc 
tgagtgcacc actcttataa aatgcaccac 
aaacaaaatt ggcaagtatg ctgtttatgc 
taatgttaca gttttccagg gtaaaagtca 
tgaaatgtgc gacattacca tgtacatgag 
ttgtgtggaa aacactggca ctttctgctg 
ggtctgtacc ctactctata ttaaatacaa 
aatgccttaa tttactaagt tacaaagcta 
tgcaaaacaa attcaaaaag ttagcattat 
tttcctgctc aataccattc ccctgaacaa 
acaaccttga agtcaggctt cctggatgtc 
cggatttgtt ccagtccaac tacagcgacc 
cgcggccgcc gctaccggac ttacatctac 
caataactgg gataacttgg gcatgtggtg 
tattattatg tggctcatct gctgcctaaa 
tcccatcatt gtgctacacc caaacaatga 
catgttcttt tctcttacag tatgattaaa 
ctgacccttg ttgcgctttt ttgtgcgtgc 
gtagactgca ttccagcctt cacagtctat 
atctgcagcc tcatcactgt ggtcatcgcc 
cgctttgcat atctcagctg ctgccatgtt 
ctgccatgct cttgtcgcct tagatctctc 
gtgatgtgtg ttttgtccta tatattttaa 
gcctttgcgt tctggtaggc cgtcattgaa 
gtaaaaaata atggttgaga ctcagcccaa 
cccaacatcg gcagatgagg tggattgaga 
ttagatgagg tggattgaga ctcatgaggg 
agttgtgctt tacagtccaa cgtgcaggac 
ggcaaattcg caactggcgc cctgtgctct 
tatttttgta aaacggaaac cgaatgtccg 
tctgccctgc cgctgggccg acgccgtccg 
cattcactca ccttgtcaag gacctgagga 
tctcaaagat cttattccct ttaactaata 
aatcagttag caaatttctg tccagtttat 
tctggtattg cagcttcctc ctggctgcaa 
tttcctcctg ttcctgtcca tccgcaccca 



cttcggcgca cgctggaaga cgcggaggct 26940 
aaggactagt ttcgcgccct ttctcaaatt 27000 
ccacacccgg cgccagcacc tgtcgtcagc 27060 
tacatgtgga gttaccagcc acaaatggga 27120 
acccgaataa actacatgag cgcgggaccc 27180 
gcccaccgaa accgaattct cttggaacag 27240 
cttaatcccc gtagttggcc cgctgccctg 27300 
gtggtacttc ccagagacgc ccaggccgaa 27360 
gcgggcggct ttcgtcacag ggtgcggtcg 27420 
agagggcgag gtattcagct caacgacgag 27480 
gacgggacat ttcagatcgg cggcgccggc 2754 0 
ctaactctgc agacctcgtc ctctgagccg 27600 
attgaggagt ttgtgccatc ggtctacttt 27660 
ccggatcaat ttattcctaa ctttgacgcg 27720 
atgttaagtg gagaggcaga gcaactgcgc 27780 
aagtgctttg cccgcgactc cggtgagttt 27840 
gagggcccgg cgcacggcgt ccggcttacc 27900 
cgggagttta cccagcgccc cctgctagtt 27960 
gtgatttgca actgtcctaa ccttggatta 28020 
gagtataata aatacagaaa ttaaaatata 28080 
caccgtcttc acccgcccaa gcaaaccaag 28140 
tccctctgtg atttacaaca gtttcaaccc 28200 
cgagctcagc tactccatca gaaaaaacac 28260 
tgcgtcaccg gccgctgcac cacacctacc 28320 
agacctcaat aactctgttt accagaacag 28380 
aggccaaagg cgcagctact gtggggttta 28440 
ctaattcagg tttctctagg gttggggtta 28500 
tactaacgct tctctgccta aggctcgccg 28560 
agctttttaa acgctggggt cgccacccaa 28620 
cacccttgcg tcagcccacg gtacttaatt 28680 
ctgtaatgtt acattcgcag ctgaagctaa 28740 
agaacatgaa aagctgctta ttcgccacaa 28B00 
tatttggcag ccaggtgaca ctacagagta 28860 
taaaactttt atgtatactt ttccatttta 28920 
caaacagtat aagttgtggc ccccacaaaa 28980 
cactgctatg ctaattacag tgctcgcttt 29040 
aagcagacgc agctttattg aggaaaagaa 29100 
atgtcaccac taactgcttt actcgctgct 29160 
aattagaata ggatttaaac cccccggtca 29220 
ttgactctat gtgggatatg ctccagcgct 29280 
agcatctgac tttggccagc acctgtcccg 29340 
caccctaaca gagatgacca acacaaccaa 29400 
cacaaataca ccccaagttt ctgcctttgt 29460 
gttctccata gcgcttatgt ttgtatgcct 29520 
gcgcaaacgc gcccgaccac ccatctatag 29580 
tggaatccat agattggacg gactgaaaca 29640 
tgagacatga ttcctcgagt ttttatatta 29700 
tccacattgg ctgcggtttc tcacatcgaa 2 9760 
ttgctttacg gatttgtcac cctcacgctc 29820 
tttatccagt gcattgactg ggtctgtgtg 29880 
gtgttgctac catgttgttt tcatgtgttg 29940 
tttatgtagt gttgtggtgt ctctcttgtc 30000 
tttttaatcc aaacccctgt ccccgcagag 30060 
aactgactta actcgttaaa ttaaaaaaat 30120 
catcggcaga tgaggtggat tgagactcag 30180 
ctcaacccca acattggcag atgaggtgaa 30240 
tggtatgagg gcccgacgtc cacaggtggg 30300 
gcttggcatt tgccagagaa caccaagatt 30360 
tcacagacgg aaaaatgacc aaaatctgat 30420 
acaaagttca tttgatgact tcccggtagg 30480 
ggaattttac aaacgatttc ggacgtctag 30540 
tctctgcacc cttattaaga ccctgtgcgg 30600 
aaaaaaaata ataaagcatc acttacttaa 30660 
tcagcagcac ctccttgccc tcctcccagc 30720 
actttctcca caatctaaat ggaatgtcag 30780 
ctatcttcat gttgttgcag atgaagcgcg 30840 
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caagaccgtc tgaagatacc ttcaaccccg 
caactgtgcc ttttcttact cctccctttg 
ctggggtact ctctttgcgc ctatccgaac 
tcaaaatggg caacggcccc tctctggacg 
ccactgtgag cccacctctc aaaaaaacca 
ccctcacagt tacctcagaa gccctaactg 
gcaacacact caccatgcaa tcacaggccc 
ttgccaccca aggacccctc acagtgtcag 
ccctcaccac caccgatagc agtaccctta 
ccactggtag cttgggcatt gacttgaaag 
gactaaagta cggggctcct ttgcatgtaa 
ctggtccagg tgtgactatt aataatactt 
gttttgattc acaaggcaat atgcaactta 
aaaacagacg ccttatactt gatgttagtt 
taagactagg acagggccct ctttttataa 
acaaaggcct ttacttgttt acagcttcaa 
gcactgccaa ggggttgatg tttgacgcta 
ttgaatttgg ttcacctaat gcaccaaaca 
gcctagaatt tgattcaaac aaggctatgg 
acagcacagg tgccattaca gtaggaaaca 
caccagctcc atctcctaac tgtagactaa 
tcttaacaaa atgtggcagt caaatacttg 
gtttggctcc aatatctgga acagttcaaa 
atggagtgct actaaacaat tccttcctgg 
atcttactga aggcacagcc tatacaaacg 
atccaaaatc tcacggtaaa actgccaaaa 
gagacaaaac taaacctgta acactaacca 
acacaactcc aagtgcatac tctatgtcat 
ttaatgaaat atttgccaca tcctcttaca 
tcgtttgtgt tatgtttcaa cgtgtttatt 
tcattcagta gtatagcccc accaccacat 
actcacagaa ccctagtatt caacctgcca 
ctttctcccc ggctggcctt aaaaagcatc 
gttatattcc acacggtttc ctgtcgagcc 
ccgggcagct cacttaagtt catgtcgctg 
acttgcggtt gcttaacggg cggcgaagga 
taatcgtgca tcaggatagg gcggtggtgc 
cgccgctccg tcctgcagga atacaacatg 
gcccgcagca taaggcgcct tgtcctccgg 
tcagcacagt aactgcagca cagcaccaca 
ctgtatccaa agctcatggc ggggaccaca 
aggtagatta agtggcgacc cctcataaac 
atgttgtaat tcaccacctc ccggtaccat 
accaccatcc taaaccagct ggccaaaacc 
ggactggaac aatgacagtg gagagcccag 
atgatatcaa tgttggcaca acacaggcac 
tcctcccgcg ttagaaccat atcccaggga 
acactgcagg gaagacctcg cacgtaactc 
ggcagcagcg gatgatcctc cagtatggta 
cgatccctac tgtacggagt gcgccgagac 
ccaaatggaa cgccggacgt agtcatattt 
aacagatctg cgtctccggt ctcgccgctt 
ccactctctc aaagcatcca ggcgccccct 
cgccgctgcc ctgataacat ccaccaccgc 
ttcgttctgc gagtcacaca cgggaggagc 
tttattccaa aagattatcc aaaacctcaa 
ctccggtggc gtggtcaaac tctacagcca 
gcacaatggc ttccaaaagg caaacggccc 
cttcagggtg aatctcctct ataaacattc 
catctcgcca ccttctcaat atatctctaa 
taaaaatctg ctccagagcg ccctccacct 
aaattcaggt tcctcacaga cctgtataag 
gcgatcccgt aggtcccttc gcagggccag 
cagcgcggcc acttccccgc caggaacctt 
catactcgga gctatgctaa ccagcgtagc 
atataaaatg caaggtgctg ctcaaaaaat 



tgtatccata tgacacggaa accggtcctc 30900 
tatcccccaa tgggtttcaa gagagtcccc 30960 
ctctagttac ctccaatggc atgcttgcgc 31020 
aggccggcaa ccttacctcc caaaatgtaa 31080 
agtcaaacat aaacctggaa atatctgcac 31140 
tggctgccgc cgcacctcta atggtcgcgg 31200 
cgctaaccgt gcacgactcc aaacttagca 31260 
aaggaaagct agccctgcaa acatcaggcc 31320 
ctatcactgc ctcaccccct ctaactactg 31380 
agcccattta tacacaaaat ggaaaactag 31440 
cagacgacct aaacactttg accgtagcaa 31500 
ccttgcaaac taaagttact ggagccttgg 31560 
atgtagcagg aggactaagg attgattctc 31620 
atccgtttga tgctcaaaac caactaaatc 31680 
actcagccca caacttggat attaactaca 31740 
acaattccaa aaagcttgag gttaacctaa 31800 
cagccatagc cattaatgca ggagatgggc 31860 
caaatcccct caaaacaaaa attggccatg 31920 
ttcctaaact aggaactggc cttagttttg 31980 
aaaataatga taagctaact ttgtggacca 32040 
atgcagagaa agatgctaaa ctcactttgg 32100 
ctacagtttc agttttggct gttaaaggca 32160 
gtgctcatct tattataaga tttgacgaaa 32220 
acccagaata ttggaacttt agaaatggag 32280 
ctgttggatt tatgcctaac ctatcagctt 32340 
gtaacattgt cagtcaagtt tacttaaacg 32400 
ttacactaaa cggtacacag gaaacaggag 324 60 
tttcatggga ctggtctggc cacaactaca 32520 
ctttttcata cattgcccaa gaataaagaa 32580 
tttcaattgc agaaaatttc aagtcatttt 32640 
agcttataca gatcaccgta ccttaatcaa 32700 
cctccctccc aacacacaga gtacacagtc 32760 
atatcatggg taacagacat attcttaggt 32820 
aaacgctcat cagtgatatt aataaactcc 32880 
tccagctgct gagccacagg ctgctgtcca 32940 
gaagtccacg cctacatggg ggtagagtca 33000 
tgcagcagcg cgcgaataaa ctgctgccgc 33060 
gcagtggtct cctcagcgat gattcgcacc 33120 
gcacagcagc gcaccctgat ctcacttaaa 33180 
atattgttca aaatcccaca gtgcaaggcg 33240 
gaacccacgt ggccatcata ccacaagcgc 33300 
acgctggaca taaacattac ctcttttggc 33360 
ataaacctct gattaaacat ggcgccatcc 33420 
tgcccgccgg ctatacactg cagggaaccg 33480 
gactcgtaac catggatcat cdtgctcgtc 33540 
acgtgcatac acttcctcag gattacaagc 3360O 
acaacccatt cctgaatcag cgtaaatccc 33660 
acgttgtgca ttgtcaaagt gttacattcg 33720 
gcgcgggttt ctgtctcaaa aggaggtaga 33780 
aaccgagatc gtgttggtcg tagtgtcatg 33840 
cctgaagcaa aaccaggtgc gggcgtgaca 33900 
agatcgctct gtgtagtagt tgtagtatat 33960 
ggcttcgggt tctatgtaaa ctccttcatg 34020 
agaataagcc acacccagcc aacctacaca 34 080 
gggaagagct ggaagaacca tgtttttttt 34140 
aatgaagatc tattaagtga acgcgctccc 34200 
aagaacagat aatggcattt gtaagatgtt 34260 
tcacgtccaa gtggacgtaa aggctaaacc 34320 
cagcaccttc aaccatgccc aaataattct 34380 
gcaaatcccg aatattaagt ccggccattg 34440 
tcagcctcaa gcagcgaatc atgattgcaa 34500 
attcaaaagc ggaacattaa caaaaatacc 34560 
ctgaacataa tcgtgcaggt ctgcacggac 34620 
gacaaaagaa cccacactga ttatgacacg 34680 
cccgatgtaa gctttgttgc atgggcggcg 34740 
caggcaaagc ctcgcgcaaa aaagaaagca 34800 
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catcgtagtc atgctcatgc agataaaggc 
acaccatttt tctctcaaac atgtctgcgg 
aaaaaacatt taaacattag aagcctgtct 
aagacggact acggccatgc cggcgtgacc 
caccaccgac agctcctcgg tcatgtccgg 
aggttgattc atcggtcagt gctaaaaagc 
gcaggcgtag agacaacatt acagccccca 
aaaacacata aacacctgaa aaaccctcct 
gaacaacata cagcgcttca cagcggcagc 
aacctattaa aaaaacacca ctcgacacgg 
gggccaagtg cagagcgagt atatatagga 
aaaaaacacc cagaaaaccg cacgcgaacc 
caacttcctc aaatcgtcac ttccgttttc 
actacaattc ccaacacata caagttactc 
cccacgcccc gcgccacgtc acaaactcca 
aaataaggta tattattgat gatg 



aggtaagctc cggaaccacc acagaaaaag 34860 
gtttctgcat aaacacaaaa taaaataaca 34920 
tacaacagga aaaacaaccc ttataagcat 34980 
gtaaaaaaac tggtcaccgt gattaaaaag 35040 
agtcataatg taagactcgg taaacacatc 35100 
gaccgaaata gcccggggga atacataccc 35160 
taggaggtat aacaaaatta ataggagaga 35220 
gcctaggcaa aatagcaccc tcccgctcca 35280 
ctaacagtca gccttaccag taaaaaagaa 35340 
caccagctca atcagtcaca gtgtaaaaaa 35400 
ctaaaaaatg acgtaacggt taaagtccac 35460 
tacgcccaga aacgaaagcc aaaaaaccca 35520 
ccacgttacg taacttccca ttttaagaaa 35580 
cgccctaaaa cctacgtcac ccgccccgtt 35640 
ccccctcatt atcatattgg cttcaatcca 35700 

35724 



<210> 14 
<211> 33988 
<212> DNA 

<213> Adenovirus subgroup C 



<400> 14 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 14 40 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 17 4 0 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 234 0 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 2460 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
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atatgtggcc gggggtgctt ggcatggacg gggtggttat tatgaatgta aggtttactg 2640 
gccccaattt tagcggtacg gttttcctgg ccaataccaa ccttatccta cacggtgtaa 2700 
gcttctatgg gtttaacaat acctgtgtgg aagcctggac cgatgtaagg gttcggggct 2760 
gtgcctttta ctgctgctgg aagggggtgg tgtgtcgccc caaaagcagg gcttcaatta 2820 
agaaatgcct ctttgaaagg tgtaccttgg gtatcctgtc tgagggtaac tccagggtgc 2880 
gccacaatgt ggcctccgac tgtggttgct tcatgctagt gaaaagcgtg gctgtgatta 2940 
agcataacat ggtatgtggc aactgcgagg acagggcctc tcagatgctg acctgctcgg 3000 
acggcaactg tcacctgctg aagaccattc acgtagccag ccactctcgc aaggcctggc 3060 
cagtgtttga gcataacata ctgacccgct gttccttgca tttgggtaac aggagggggg 3120 
tgttcctacc ttaccaatgc aatttgagtc acactaagat attgcttgag cccgagagca 3180 
tgtccaaggt gaacctgaac ggggtgtttg acatgaccat gaagatctgg aaggtgctga 3240 
ggtacgatga gacccgcacc aggtgcagac cctgcgagtg tggcggtaaa catattagga 3300 
accagcctgt gatgctggat gtgaccgagg agctgaggcc cgatcacttg gtgctggcct 3360 
gcacccgcgc tgagtttggc tctagcgatg aagatacaga ttgaggtact gaaatgtgtg 3420 
ggcgtggctt aagggtggga aagaatatat aaggtggggg tcttatgtag ttttgtatct 3480 
gttttgcagc agccgccgcc gccatgagca ccaactcgtt tgatggaagc attgtgagct 3540 
catatttgac aacgcgcatg cccccatggg ccggggtgcg tcagaatgtg atgggctcca 3600 
gcattgatgg tcgccccgtc ctgcccgcaa actctactac cttgacctac gagaccgtgt 3660 
ctggaacgcc gttggagact gcagcctccg ccgccgcttc agccgctgca gccaccgccc 3720 
gcgggattgt gactgacttt gctttcctga gcccgcttgc aagcagtgca gcttcccgtt 3780 
catccgcccg cgatgacaag ttgacggctc ttttggcaca attggattct ttgacccggg 3840 
aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 3900 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 3960 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 4020 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 4080 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 4140 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 4200 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 4260 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4 440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4 560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4 620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4 680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4740 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4 920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4 980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 504 0 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 5340 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 54 60 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 5940 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 6240 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
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ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtarc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 
gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgagtcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 



tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 714 0 
cgaggtgtgg gtgagcgcaa aggtgtccct 7 200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 74 40 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7620 
acctatggcc attttttctg gggtgatgca 7680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 924 0 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 94 80 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
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gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgcc gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg accrgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggrgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14 400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 144 60 
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tgtttaccaa 
aggtggagct 
ccatgaccat 
agaacggggt 
ggtttgaccc 
cagacatcat 
tgttgggcat 
tggagggtgg 
atgacaccga 
aagagaactc 
ccattcgcgg 
cggccgaagc 
tgatcaaacc 
gcaccttcac 
gaatccgctc 
actggtcgtt 
gcaactttcc 
accaggccgt 
gctttcccga 
aaaacgttcc 
tccagcgagt 
tgggcatagt 
ttatatcgcc 
gggccaagaa 
ggggcgcgca 
tggtggagga 
ccattcagac 
gcgtagcacg 
tgcttaaccg 
ccgcgggtat 
cggccattag 
ttagcggcct 
actacttaga 
ccaagcgcaa 
cgaagaagga 
aagatgatga 
gacgggtaca 
tctttacgcc 
gcgacgagga 
ggcataagga 
ccgtaacact 
agcgcgagtc 
tggaagatgt 
ggccaatcaa 
ctaccagtag 
ttgcctcagc 
ctacggaggt 
gttcgaggaa 
ttgcgcctac 
gacgccgaac 
cgatttccgt 
gctaccaccc 
cctgccgcct 
tggccggcca 
cgcaccgtcg 
ttggcgccgt 
caagttgcat 
taactatttt 
cgcccgttca 
agctggggct 
agcaaggcct 
ttccaacaaa 
aaccaggcag 
cctccaccgg 
gacagggaag 
aagcaaggcc 



taagtttaag 
gaaatacgag 
agaccttatg 
tctggaaagc 
cgtcactggt 
tttgctgcca 
ccgcaagcgg 
taacattccc 
acagggcggg 
caacgcggca 
cgacaccttt 
tgccgccccc 
cctgacagag 
ccagtaccgc 
atggaccctg 
gccagacatg 
ggtggtgggc 
ctactcccaa 
gaaccagatt 
tgctctcaca 
gaccattact 
ctcgccgcgc 
cagcaataac 
gcgctccgac 
caaacgcggc 
ggcgcgcaac 
cgtggtgcgc 
tcgccaccgc 
cgcacgtcgc 
tgtcactgtg 
tgctatgact 
gcgcgtgccc 
ctcgtactgt 
aatcaaagaa 
agagcaggat 
tgatgaactt 
gtggaaaggt 
cggtgagcgc 
cctgcttgag 
catgctggcg 
gcagcaggtg 
tggtgacttg 
cttggaaaaa 
gcaggtggcg 
caccagtatt 
ggtggcggat 
gcaaacggac 
gtacggcgcc 
ccccggctat 
caccactgga 
gcgcagggtg 
cagcatcgtt 
ccgtttcccg 
cggcctgacc 
catgcgcggc 
gcccggaatt 
gtggaaaaat 
gtagaatgga 
tgggaaactg 
cgctgtggag 
ggaacagcag 
aggtggtaga 
tgcaaaataa 
ccgtggagac 
aaactctggt 
tgcccaccac 



gcgcgggtga 
tgggtggagt 
aacaacgcga 
gacatcgggg 
cttgtcatgc 
ggatgcgggg 
caacccttcc 
gcactgttgg 
ggtggcgcag 
gccgcggcaa 
gccacacggg 
gctgcgcaac 
gacagcaaga 
agctggtacc 
ctttgcactc 
atgcaagacc 
gccgagctgt 
ctcatccgcc 
ttggcgcgcc 
gatcacggga 
gacgccagac 
gtcctatcga 
acaggctggg 
caacacccag 
cgcactgggc 
tacacgccca 
ggagcccggc 
cgccgacccg 
accggccgac 
ccccccaggt 
cagggtcgca 
gtgcgcaccc 
tgtatgtatc 
gagatgctcc 
tacaagcccc 
gacgacgagg 
cgacgcgtaa 
tccacccgca 
caggccaacg 
ttgccgctgg 
ctgcccgcgc 
gcacccaccg 
atgaccgtgg 
ccgggactgg 
gccaccgcca 
gccgcggtgc 
ccgtggatgt 
gccagcgcgc 
cgtggctaca 
acccgccgcc 
gctcgcgaag 
taaaagccgg 
gtgccgggat 
ggcggcatgc 
ggtatcctgc 
gcatccgtgg 
caaaataaaa 
agacatcaac 
gcaagatatc 
cggcattaaa 
cacaggccag 
tggcctggcc 
gattaacagt 
agtgtctcca 
gacgcaaata 
ccgtcccatc 



tggtgtcgcg 
tcacgctgcc 
tcgtggagca 
taaagtttga 
ctggggtata 
tggacttcac 
aggagggctt 
atgtggacgc 
gcggcagcaa 
tgcagccggt 
ctgaggagaa 
ccgaggtcga 
aacgcagtta 
ttgcatacaa 
ctgacgtaac 
ccgtgacctt 
tgcccgtgca 
agtttacctc 
cgccagcccc 
cgctaccgct 
gccgcacctg 
gccgcacttt 
gcctgcgctt 
tgcgcgtgcg 
gcaccaccgt 
cgccgccacc 
gctatgctaa 
gcactgccgc 
gggcggccat 
ccaggcgacg 
ggggcaacgt 
gccccccgcg 
cagcggcggc 
aggtcatcgc 
gaaagctaaa 
tggaactgct 
aacgtgtttt 
cctacaagcg 
agcgcctcgg 
acgagggcaa 
ttgcaccgtc 
tgcagctgat 
aacctgggct 
gcgtgcagac 
cagagggcat 
aggcggtcgc 
ttcgcgtttc 
tactgcccga 
cctaccgccc 
gccgtcgccg 
gaggcaggac 
tctttgtggt 
tccgaggaag 
gtcgtgcgca 
ccctccttat 
ccttgcaggc 
agtctggact 
tttgcgtctc 
ggcaccagca 
aatttcggtt 
atgctgaggg 
tctggcatta 
aagcttgatc 
gaggggcgtg 
gacgagcctc 
gcgcccatgg 



cttgcctact 

cgagggcaac 

ctacttgaaa 

cacccgcaac 

tacaaacgaa 

ccacagccgc 

taggatcacc 

ctaccaggcg 

cagcagtggc 

ggaggacatg 

gcgcgctgag 

gaagcctcag 

caacctaata 

ctacggcgac 

ctgcggctcg 

ccgctccacg 

ctccaagagc 

tctgacccac 

caccatcacc 

gcgcaacagc 

cccctacgtt 

ttgagcaagc 

cccaagcaag 

cgggcactac 

cgatgacgcc 

agtgtccaca 

aatgaagaga 

ccaacgcgcg 

gcgggccgct 

agcggccgcc 

gtattgggtg 

caactagatt 

ggcgcgcaac 

gccggagatc 

gcgggtcaaa 

gcacgctacc 

gcgacccggc 

cgtgtatgat 

ggagtttgcc 

cccaacacct 

cgaagaaaag 

ggtacccaag 

ggagcccgag 

cgtggacgtt 

ggagacacaa 

tgcggccgcg 

agccccccgg 

atatgcccta 

cagaagacga 

tcgccagccc 

cctggtgctg 

tcttgcagat 

aatgcaccgt 

ccaccggcgg 

tccactgatc 

gcagagacac 

ctcacgctcg 

tggccccgcg 

atatgagcgg 

ccaccgttaa 

ataagttgaa 

gcggggtggt 

cccgccctcc 

gcgaaaagcg 

cctcgtacga 

ctaccggagt 



aaggacaatc 

tactccgaga 

gtgggcagac 

ttcagactgg 

gccttccatc 

ctgagcaact 

tacgatgatc 

agcttgaaag 

agcggcgcgg 

aacgatcatg 

gccgaagcag 

aagaaaccgg 

agcaatgaca 

cctcagaccg 

gagcaggtct 

cgccagatca 

ttctacaacg 

gtgttcaatc 

accgtcagtg 

atcggaggag 

tacaaggccc 

atgtccatcc 

atgtttggcg 

cgcgcgccct 

atcgacgcgg 

gtggacgcgg 

cggcggaggc 

gcggcggccc 

cgaaggctgg 

gcagcagccg 

cgcgactcgg 

gcaagaaaaa 

gaagctatgt 

tatggccccc 

aagaaaaaga 

gcgcccaggc 

accaccgtag 

gaggtgtacg 

tacggaaagc 

agcctaaagc 

cgcggcctaa 

cgccagcgac 

gtccgcgtgc 

cagataccca 

acgtccccgg 

tccaagacct 

cgcccgcgcg 

catccttcca 

gcaactaccc 

gtgctggccc 

ccaacagcgc 

atggccctca 

aggaggggca 

cggcgcgcgt 

gccgcggcga 

tgattaaaaa 

cttggtcctg 

acacggctcg 

tggcgccttc 

gaactatggc 

agagcaaaat 

ggacctggcc 

cgtagaggag 

tccgcgcccc 

ggaggcacta 

gctgggccag 



14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 
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cacacacccg taacgctgga cctgcctccc 
ccaggcccga ccgccgttgt tgtaacccgt 
agcggtccgc gatcgttgcg gcccgtagcc 
atcgtgggtc tgggggtgca atccctgaag 
tcgtatgtgt gtcatgtatg cgtccatgtc 
gcccgctttc caagatggct accccttcga 
cgggccagga cgcctcggag tacctgagcc 
agacgtactt cagcctgaat aacaagttta 
tgaccacaga ccggtcccag cgtttgacgc 
ctgcgtactc gtacaaggcg cggttcaccc 
tggcttccac gtactttgac atccgcggcg 
actctggcac tgcctacaac gccctggctc 
atgaagctgc tactgctctt gaaataaacc 
aagtagacga gcaagctgag cagcaaaaaa 
gtataaatat tacaaaggag ggtattcaaa 
ccgataaaac atttcaacct gaacctcaaa 
ttaatcatgc agctgggaga gtccttaaaa 
catatgcaaa acccacaaat gaaaatggag 
gaaagctaga aagtcaagtg gaaatgcaat 
atggtgataa cttgactcct aaagtggtat 
cagacactca tatttcttac atgcccacta 
gccaacaatc tatgcccaac aggcctaatt 
taatgtatta caacagcacg ggtaatatgg 
atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 



cccgccgaca cccagcagaa acctgtgctg 18480 
cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agtggcaact ggcaaagcac actgaacagc 18600 
cgccgacgat gcttctgaat agctaacgtg 18660 
gccgccagag gagctgctga gccgccgcgc 18720 
tgatgccgca gtggtcttac atgcacatct 18780 
ccgggctggt gcagtttgcc cgcgccaccg 18840 
gaaaccccac ggtggcgcct acgcacgacg 18900 
tgcggttcat ccctgtggac cgtgaggata 18960 
tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 19440 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 19740 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 204 60 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 2124 0 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 2154 0 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 2184 0 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
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cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 228 60 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22 920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 234 00 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 234 60 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 237 00 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24 300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 257 40 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
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cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 274 80 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27 660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga taattaagtg gagaggcaga gcaactgcgc 277 80 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27 84 0 
tgctactttg aattgcccga ggatcatatc gaggatcttt gttgccatct ctgtgctgag 27 900 
tataataaat acagaaatta aaatatactg gggctcctat cgccatcctg taaacgccac 27960 
cgtcttcacc cgcccaagca aaccaaggcg aaccttacct ggtactttta acatctctcc 28020 
ctctgtgatt tacaacagtt tcaacccaga cggagtgagt ctacgagaga acctctccga 28080 
gctcagctac tccatcagaa aaaacaccac cctccttacc tgccgggaac gtacccttaa 28140 
ttaaaagtca ggcttcctgg atgtcagcat ctgactttgg ccagcacctg tcccgcggat 28200 
ttgttccagt ccaactacag cgacccaccc taacagagat gaccaacaca accaacgcgg 28260 
ccgccgctac cggacttaca tctaccacaa atacacccca agtttctgcc tttgtcaata 28320 
actgggataa cttgggcatg tggtggttct ccatagcgct tatgtttgta tgccttatta 28380 
ttatgtggct catctgctgc ctaaagcgca aacgcgcccg accacccatc tatagtccca 284 40 
tcattgtgct acacccaaac aatgatggaa tccatagatt ggacggactg aaacacatgt 28300 
tcttttctct tacagtatga ttaaatgaga ttaattaagg aatttctgtc cagtttattc 28560 
agcagcacct ccttgccctc ctcccagctc tggtattgca gcttcctcct ggctgcaaac 28620 
tttctccaca atctaaatgg aatgtcagtt tcctcctgtt cctgtccatc cgcacccact 28680 
atcttcatgt tgttgcagat gaagcgcgca agaccgtctg aagatacctt caaccccgtg 28740 
tatccatatg acacggaaac cggtcctcca actgtgcctt ttcttactcc tccctttgta 28800 
tcccccaatg ggtttcaaga gagtccccct ggggtactct ctttgcgcct atccgaacct 28860 
ctagttacct ccaatggcat gcttgcgctc aaaatgggca acggcctctc tctggacgag 28920 
gccggcaacc ttacctccca aaatgtaacc actgtgagcc cacctctcaa aaaaaccaag 28980 
tcaaacataa acctggaaat atctgcaccc ctcacagtta cctcagaagc cctaactgtg 29040 
gctgccgccg cacctctaat ggtcgcgggc aacacactca ccatgcaatc acaggccccg 29100 
ctaaccgtgc acgactccaa acttagcatt gccacccaag gacccctcac agtgtcagaa 29160 
ggaaagctag ccctgcaaac atcaggcccc ctcaccacca ccgatagcag tacccttact 29220 
atcactgcct caccccctct aactactgcc actggtagct tgggcattga cttgaaagag 29280 
cccatttata cacaaaatgg aaaactagga ctaaagtacg gggctccttt gcatgtaaca 29340 
gacgacctaa acactttgac cgtagcaact ggtccaggtg tgactattaa taatacttcc 29400 
ttgcaaacta aagttactgg agccttgggt tttgattcac aaggcaatat gcaacttaat 29460 
gtagcaggag gactaaggat tgattctcaa aacagacgcc ttatacttga tgttagttat 29520 
ccgtttgatg ctcaaaacca actaaatcta agactaggac agggccctct ttttataaac 29580 
tcagcccaca acttggatat taactacaac aaaggccttt acttgtttac agcttcaaac 2 9640 
aattccaaaa agcttgaggt taacctaagc actgccaagg ggttgatgtt tgacgctaca 29700 
gccatagcca ttaatgcagg agatgggctt gaatttggtt cacctaatgc accaaacaca 29760 
aatcccctca aaacaaaaat tggccatggc ctagaatttg attcaaacaa ggctatggtt 29820 
cctaaactag gaactggcct tagttttgac agcacaggtg ccattacagt aggaaacaaa 29880 
aataatgata agctaacttt gtggaccaca ccagctccat ctcctaactg tagactaaat 29940 
gcagagaaag atgctaaact cactttggtc ttaacaaaat gtggcagtca aatacttgct 30000 
acagtttcag ttttggctgt taaaggcagt ttggctccaa tatctggaac agttcaaagt 30060 
gctcatctta ttataagatt tgacgaaaat ggagtgctac taaacaattc cttcctggac 30120 
ccagaatatt ggaactttag aaatggagat cttactgaag gcacagccta tacaaacgct 30180 
gttggattta tgcctaacct atcagcttat ccaaaatctc acggtaaaac tgccaaaagt 30240 
aacattgtca gtcaagttta cttaaacgga gacaaaacta aacctgtaac actaaccatt 30300 
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acactaaacg gtacacagga aacaggagac 
tcatgggact ggtctggcca caactacatt 
ttttcataca ttgcccaaga ataaagaatc 
tcaattgcag aaaatttcaa gtcatttttc 
cttatacaga tcaccgtacc ttaatcaaac 
tccctcccaa cacacagagt acacagtcct 
atcatgggta acagacatat tcttaggtgt 
acgctcatca gtgatattaa taaactcccc 
cagctgctga gccacaggct gctgtccaac 
agtccacgcc tacatggggg tagagtcata 
cagcagcgcg cgaataaact gctgccgccg 
agtggtctcc tcagcgatga ttcgcaccgc 
acagcagcgc accctgatct cacttaaatc 
attgttcaaa atcccacagt gcaaggcgct 
acccacgtgg ccatcatacc acaagcgcag 
gctggacata aacattacct cttttggcat 
aaacctctga ttaaacatgg cgccatccac 
cccgccggct atacactgca gggaaccggg 
ctcgtaacca tggatcatca tgctcgtcat 
gtgcatacac ttcctcagga ttacaagctc 
aacccattcc tgaatcagcg taaatcccac 
gttgtgcatt gtcaaagtgt tacattcggg 
gcgggtttct gtctcaaaag gaggtagacg 
ccgagatcgt gttggtcgta gtgtcatgcc 
tgaagcaaaa ccaggtgcgg gcgtgacaaa 
atcgctctgt gtagtagttg tagtatatcc 
cttcgggttc tatgtaaact ccttcatgcg 
aataagccac acccagccaa cctacacatt 
gaagagctgg aagaaccatg tttttttttt 
tgaagatcta ttaagtgaac gcgctcccct 
gaacagataa tggcatttgt aagatgttgc 
acgtccaagt ggacgtaaag gctaaaccct 
gcaccttcaa ccatgcccaa ataattctca 
aaatcccgaa tattaagtcc ggccattgta 
agcctcaagc agcgaatcat gattgcaaaa 
tcaaaagcgg aacattaaca aaaataccgc 
gaacataatc gtgcaggtct gcacggacca 
caaaagaacc cacactgatt atgacacgca 
cgatgtaagc tttgttgcat . gggcggcgat 
ggcaaagcct cgcgcaaaaa agaaagcaca 
gtaagctccg gaaccaccac agaaaaagac 
ttctgcataa acacaaaata aaataacaaa 
caacaggaaa aacaaccctt ataagcataa 
aaaaaaactg gtcaccgtga ttaaaaagca 
tcataatgta agactcggta aacacatcag 
ccgaaatagc ccgggggaat acatacccgc 
ggaggtataa caaaattaat aggagagaaa 
ctaggcaaaa tagcaccctc ccgctccaga 
aacagtcagc cttaccagta aaaaagaaaa 
ccagctcaat cagtcacagt gtaaaaaagg 
ctcagccatg gcacctctgc agcctgggta 
ggcggggcgt gggggctctg taggagtggc 
gagccctcca ggtgcttgat cccacctttt 
acctgtcctg gctgatggcc tgttcctgcc 
cctggacttg tggcttcctc caaccagggg 
tccaggccgt gggagggagg ttgaggaggg 
tgttcccctc ccatcccctc cctgggcctc 
ttgagagctt cctgcctggc tcttaaccca 
actctcacca cctccttacc ttttcccctg 
gaaaactaca attcccaaca catacaagtt 
cgttcccacg ccccgcgcca cgtcacaaac 
tccaaaataa ggtatattat tgatgatg 



acaactccaa gtgcatactc tatgtcattt 30360 
aatgaaatat ttgccacatc ctcttacact 30420 
gtttgtgtta tgtttcaacg tgtttatttt 30480 
attcagtagt atagccccac caccacatag 30540 
tcacagaacc ctagtattca acctgccacc 30600 
ttctccccgg ctggccttaa aaagcatcat 30660 
tatattccac acggtttcct gtcgagccaa 30720 
gggcagctca cttaagttca tgtcgctgtc 30780 
ttgcggttgc ttaacgggcg gcgaaggaga 30840 
atcgtgcatc aggatagggc ggtggtgctg 30900 
ccgctccgtc ctgcaggaat acaacatggc 30960 
ccgcagcata aggcgccttg tcctccgggc 31020 
agcacagtaa ctgcagcaca gcaccacaat 31080 
gtatccaaag ctcatggcgg ggaccacaga 31140 
gtagattaag tggcgacccc tcataaacac 31200 
gttgtaattc accacctccc ggtaccatat 31260 
caccatccta aaccagctgg ccaaaacctg 31320 
actggaacaa tgacagtgga gagcccagga 31380 
gatatcaatg ttggcacaac acaggcacac 31440 
ctcccgcgtt agaaccatat cccagggaac 31500 
actgcaggga agacctcgca cgtaactcac 31560 
cagcagcgga tgatcctcca gtatggtagc 31620 
atccctactg tacggagtgc gccgagacaa 31680 
aaatggaacg ccggacgtag tcatatttcc 31740 
cagatctgcg tctccggtct cgccgcttag 31800 
actctctcaa agcatccagg cgccccctgg 31860 
ccgctgccct gataacatcc accaccgcag 31920 
cgttctgcga gtcacacacg ggaggagcgg 31980 
tattccaaaa gattatccaa aacctcaaaa 32040 
ccggtggcgt ggtcaaactc tacagccaaa 32100 
acaatggctt ccaaaaggca aacggccctc 32160 
tcagggtgaa tctcctctat aaacattcca 32220 
tctcgccacc ttctcaatat atctctaagc 32280 
aaaatctgct ccagagcgcc ctccaccttc 32340 
attcaggttc ctcacagacc tgtataagat 32400 
gatcccgtag gtcccttcgc agggccagct 32460 
gcgcggccac ttccccgcca ggaaccttga 32520 
tactcggagc tatgctaacc agcgtagccc 32580 
ataaaatgca aggtgctgct caaaaaatca 32640 
tcgtagtcat gctcatgcag ataaaggcag 32700 
accatttttc tctcaaacat gtctgcgggt 32760 
aaaacattta aacattagaa gcctgtctta 32820 
gacggactac ggccatgccg gcgtgaccgt 32880 
ccaccgacag ctcctcggtc atgtccggag 32940 
gttgattcat cggtcagtgc taaaaagcga 33000 
aggcgtagag acaacattac agcccccata 33060 
aacacataaa cacctgaaaa accctcctgc 33120 
acaacataca gcgcttcaca gcggcagcct 33180 
cctattaaaa aaacaccact cgacacggca 33240 
gccaagtgcg ttacactgca gcaggtgtga 33300 
ccctgcttgg ggcatggccc cttatagctg 33360 
agcgacctca gtgtttgtct ttgctctgaa 33420 
cccagcagga acactcctgc ctgccttacc 33480 
tcctttgccc cctgcccaga ctcccatgtt 3354 0 
ctctcaagcc tccatacctg gtcccacctc 33600 
tggagggcat ctggttgggg gcagcctggg 33660 
ccaggccccc tctactcttg agcaatgctc 33720 
gggcaagccc tggaagggca gacccaggac 33780 
gaaaaatctt ctgtatactt cccattttaa 33840 
actccgccct aaaacctacg tcacccgccc 33900 
tccaccccct cattatcata ttggcttcaa 33960 

33988 



<210> 15 
<211> 34737 
<212> DNA 
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<213> Adenovirus subgroup C 
<400> 15 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 14 40 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 27 60 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
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catccgcccg cgatgacaag ttgacggctc ttttggcaca attggattct ttgacccggg 3840 
aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 3900 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 3960 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 4020 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 4080 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 4140 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 4200 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 4260 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4 560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4 620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4 680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4740 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 504 0 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 534 0 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 5460 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 594 0 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 6240 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 684 0 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 72O0 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
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cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 
gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgagtcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 
gagcctgtaa gcgggcactc ttccgtggtc 
ggacgaccgg ggttcgagcc ccgtatccgg 
cgcgtgtcga acccaggtgt gcgacgtcag 
tccaggcgcg gcggctgctg cgctagcttt 
gttaggctgg aaagcgaaag cattaagtgg 
caagggttga gtcgcgggac ccccggttcg 
ggggtttgcc tccccgtcat gcaagacccc 
gccccttttt tgcttttccc agatgcatcc 
gcagcggcaa gagcaagagc agcggcagac 
gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 



gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 94 80 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10060 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
tggtggataa attcgcaagg gtatcatggc 10560 
ccgtccgccg tgatccatgc ggttaccgcc 10620 
acaacggggg agtgctcctt ttggcttcct 10680 
tttggccact ggccgcgcgc agcgtaagcg 10740 
ctcgctccct gtagccggag ggttattttc 10800 
agtctcggac cggccggact gcggcgaacg 10860 
gcttgcaaat tcctccggaa acagggacga 10920 
ggtgctgcgg cagatgcgcc cccctcctca 10980 
atgcagggca ccctcccctc ctcctaccgc 11040 
ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 114 60 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
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ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 
cgacctgggc gtttatcgca acgagcgcat 
cgagctcagc gaccgcgagc tgatgcacag 
cggcgataga gaggccgagt cctactttga 
ccgacgcgcc ctggaggcag ctggggccgg 
tggcaacgtc ggcggcgtgg aggaatatga 
cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tcrccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
ctraacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtctc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt tcggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 



caaggtggcc gccatcaact attccatgct 117 60 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
ccacaaggcc gtgagcgtga gccggcggcg 11940 
cctgcaaagg gccctggctg gcacgggcag 12000 
cgcgggcgct gacctgcgct gggccccaag 12060 
acctgggctg gcggtggcac ccgcgcgcgc 12120 
cgaggacgat gagtacgagc cagaggacgg 12180 
atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 12540 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 134 4 0 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcctggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggcaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 14460 
tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14 640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
tgcagccggt ggaggacatg aacgatcatg 15060 
ctgaggagaa gcgcgctgag gccgaagcag 15120 
ccgaggtcga gaagcctcag aagaaaccgg 15180 
aacgcagtta caacctaata agcaatgaca 15240 
ttgcatacaa ctacggcgac cctcagaccg 15300 
ctgacgtaac ctgcggctcg gagcaggtct 15360 
ccgtgacctt ccgctccacg cgccagatca 15420 
tgcccgtgca ctccaagagc ttctacaacg 15480 
agtttacctc tctgacccac gtgttcaatc 15540 
cgccagcccc caccatcacc accgtcagtg 15600 
cgctaccgct gcgcaacagc atcggaggag 15660 
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tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 157B0 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 1794 0 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 184 80 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
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cagacactca tatttcttac atgcccacta 
gccaacaatc tatgcccaac aggcctaatt 
taatgtatta caacagcacg ggtaatatgg 
atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 
gtcggtgttg gagatctgca ccacatttcg 
gctagactgc tccttcagcg cgcgctgccc 
gtgctcctta tttatcataa tgcttccgtg 
gcagcggtgc agccacaacg cgcagcccgt 
aaacgactgc aggtacgcct gcaggaatcg 
ggtgaaggtc agctgcaacc cgcggtgctc 
cagagcttcc acttggtcag gcagtagttt 
gtacttgtcc atcagcgcgc gcgcagcctc 
cacactcagc gggttcatca ccgtaatttc 
ctcttgcgtc cgcataccac gcgccactgg 
cttacctcct ttgccatgct tgattagcac 



ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 19740 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 20460 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 22740 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
gccccaccgg ttcttcacga tcttggcctt 22980 
gttttcgctc gtcacatcca tttcaatcac 23040 
tagacactta agctcgcctt cgatctcagc 23100 
gggctcgtga tgcttgtagg tcacctctgc 23160 
ccccatcatc gtcacaaagg tcttgttgct 23220 
ctcgttcagc caggtcttgc atacggccgc 23280 
gaagttcgcc tttagatcgt tatccacgtg 23340 
catgcccttc tcccacgcag acacgatcgg 23400 
actttccgct tcgctgggct cttcctcttc 23460 
gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cggtgggttg ctgaaaccca ccatttgtag 23580 
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cgccacatct tctctttctt cctcgctgtc 

gggcttggga gaagggcgct tctttttctt 

ggtcgatggc cgcgggctgg gcgtgcgcgg 

gtcctcggac tcgatacgcc gcctcatccg 

cgacggggac ggggacgaca cgtcctccat 

gcgctcgggg gtggtttcgc gctgctcctc 

gcagaaaaag atcatggagt cagtcgagaa 

cgccaccacc gcctccaccg atgccgccaa 

cccgcttgag gaggaggaag tgattatcga 

cgaggaccgc tcagtaccaa cagaggataa 

cgaggaacaa gtcgggcggg gggacgaaag 

cgtgctgttg aagcatctgc agcgccagtg 

cagcgatgtg cccctcgcca tagcggatgt 

accgcgcgta ccccccaaac gccaagaaaa 

cttctacccc gtatttgccg tgccagaggt 

ctgcaagata cccctatcct gccgtgccaa 

gcggcagggc gctgtcatac ctgatatcgc 

gggtcttgga cgcgacgaga agcgcgcggc 

tgaaagtcac tctggagtgt tggtggaact 

aaaacgcagc atcgaggtca cccactttgc 

catgagcaca gtcatgagtg agctgatcgt 

aaatttgcaa gaacaaacag aggagggcct 

ctggcttcaa acgcgcgagc ctgccgactt 

agtgctcgtt accgtggagc ttgagtgcat 

gcgcaagcta gaggaaacat tgcactacac 

caagatctcc aacgtggagc tctgcaacct 

ccgccttggg caaaacgtgc ttcattccac 

ccgcgactgc gtttacttat ttctatgcta 

gcagtgcttg gaggagtgca acctcaagga 

ggacctatgg acggccttca acgagcgctc 

ccccgaacgc ctgcttaaaa ccctgcaaca 

gttgcagaac tttaggaact ttatcctaga 

tgcacttcct agcgactttg tgcccattaa 

ccactgctac cttctgcagc tagccaacta 

cgtgagcggt gacggtctac tggagtgtca 

ctccctggtt tgcaattcgc agctgcttaa 

gcagggtccc tcgcctgacg aaaagtccgc 

gtggacgtcg gcttaccttc gcaaatttgt 

gttctacgaa gaccaatccc gcccgccaaa 

gggccacatt cttggccaat tgcaagccat 

aaagggacgg ggggtttact tggaccccca 

gccgccgcag ccctatcagc agcagccgcg 

agaagctgca gctgccgccg ccacccacgg 

aggaggtttt ggacgaggag gaggaggaca 

aagcttccga ggtcgaagag gtgtcagacg 

cgccggcgcc ccagaaatcg gcaaccggtt 

cgccgccggc actgcccgtt cgccgaccca 

ccggtaagtc caagcagccg ccgccgttag 

gctcatggcg cgggcacaag aacgccatag 

tctccttcgc ccgccgcttt cttctctacc 

tgcattacta ccgtcatctc tacagcccat 

acagcagcgg ccacacagaa gcaaaggcga 

aaatccacag cggcggcagc agcaggagga 

gtatcgaccc gcgagcttag aaacaggatt 

agcaggggcc aagaacaaga gctgaaaata 

agctgcctgt atcacaaaag cgaagatcag 

ctcttcagta aatactgcgc gctgactctt 

taagcgcgaa aactacgtca tctccagcgg 

gccattatga gcaaggaaat tcccacgccc 

cttgcggctg gagctgccca agactactca 

cacatgatat cccgggtcaa cggaatccgc 

gcggctatta ccaccacacc tcgtaataac 

gtgtaccagg aaagtcccgc tcccaccact 

gttcagatga ctaactcagg ggcgcagctt 

cccgggcagg gtataactca cctgacaatc 

tcggtgagct cctcgcttgg tctccgtccg 



cacgautacc tctggtgatg gcgggcgctc 23640 
cttgggcgca atggccaaat ccgccgccga 23700 
caccagcgcg tcttgtgatg agtcttcctc 23760 
cttttttggg ggcgcccggg gaggcggcgg 23820 
ggttggggga cgtcgcgccg caccgcgtcc 23880 
ttcccgactg gccatttcct tctcctatag 23940 
gaaggacagc ctaaccgccc cctctgagtt 24000 
cgcgcctacc accttccccg tcgaggcacc 24060 
gcaggaccca ggttttgtaa gcgaagacga 24120 
aaagcaagac caggacaacg cagaggcaaa 24180 
gcatggcgac tacctagatg tgggagacga 24240 
cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagccttgcc tacgaacgcc acctattctc 24360 
cggcacatgc gagcccaacc cgcgcctcaa 24 420 
gcttgccacc tatcacatct ttttccaaaa 24480 
ccgcagccga gcggacaagc agctggcctt 24540 
ctcgctcaac gaagtgccaa aaatctttga 24 600 
aaacgctctg caacaggaaa acagcgaaaa 24 660 
cgagggtgac aacgcgcgcc tagccgtact 24720 
ctacccggca cttaacctac cccccaaggt 24780 
gcgccgtgcg cagcccctgg agagggatgc 24840 
acccgcagtt ggcgacgagc agctagcgcg 24 900 
ggaggagcga cgcaaactaa tgatggccgc 24960 
gcagcggttc tttgctgacc cggagatgca 25020 
ctttcgacag ggctacgtac gccaggcctg 25080 
ggtctcctac cttggaattt tgcacgaaaa 25140 
gctcaagggc gaggcgcgcc gcgactacgt 25200 
cacctggcag acggccatgg gcgtttggca 25260 
gctgcagaaa ctgctaaagc aaaacttgaa 25320 
cgtggccgcg cacctggcgg acatcatttt 25380 
gggtctgcca gacttcacca gtcaaagcat 254 40 
gcgctcagga atcttgcccg ccacctgctg 25500 
gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccttgcctac cactctgaca taatggaaga 25620 
ctgtcgctgc aacctatgca ccccgcaccg 25680 
cgaaagtcaa attatcggta cctttgagct 25740 
ggctccgggg ttgaaactca ctccggggct 25800 
acctgaggac taccacgccc acgagattag 25860 
tgcggagctt accgcctgcg tcattaccca 25920 
caacaaagcc cgccaagagt ttctgctacg 25980 
gtccggcgag gagctcaacc caatcccccc 2 6040 
ggcccttgct tcccaggatg gcacccaaaa 26100 
acgaggagga atactgggac agtcaggcag 26160 
tgatggaaga ctgggagagc ctagacgagg 26220 
aaacaccgtc accctcggtc gcattcccct 26280 
ccagcatggc tacaacctcc gctcctcagg 26340 
accgtagatg ggacaccact ggaaccaggg 26400 
cccaagagca acaacagcgc caaggctacc 264 60 
ttgcttgctt gcaagactgt gggggcaaca 26520 
atcacggcgt ggccttcccc cgtaacatcc 26580 
actgcaccgg cggcagcggc agcggcagca 26640 
ccggatagca agactctgac aaagcccaag 26700 
ggagcgctgc gtctggcgcc caacgaaccc 26760 
tttcccactc tgtatgctat atttcaacag 26820 
aaaaacaggt ctctgcgatc cctcacccgc 26880 
cttcggcgca cgctggaaga cgcggaggct 26940 
aaggactagt ttcgcgccct ttctcaaatt 27000 
ccacacccgg cgccagcacc tgtcgtcagc 27060 
tacatgtgga gttaccagcc acaaatggga 27120 
acccgaataa actacatgag cgcgggaccc 27180 
gcccaccgaa accgaattct cttggaacag 27240 
cttaatcccc gtagttggcc cgctgccctg 27300 
gtggtacttc ccagagacgc ccaggccgaa 27360 
gcgggcggct ttcgtcacag ggtgcggtcg 27420 
agagggcgag gtattcagct caacgacgag 27480 
gacgggacat ttcagatcgg cggcgccggc 27540 
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cgtccttcat 
cgctctggag 
aaccccttct 
gtaaaggact 
ctgaaacacc 
tgctactttg 
gcccagggag 
gagcgggaca 
catcaagatc 
ctggggctcc 
gcgaacctta 
agacggagtg 
caccctcctt 
gcctgaccgt 
gaggtgagct 
tgaacaattc 
cctggatgtc 
tacagcgacc 
ttacatctac 
gcatgtggtg 
gctgcctaaa 
caaacaatga 
tatgattaaa 
acgcagggca 
gcaccagtgc 
taataccacc 
catggtggga 
tcactcacct 
caaagatctt 
cagttagcaa 
ggtattgcag 
cctcctgttc 
gaccgtctga 
ctgtgccttt 
gggtactctc 
aaatgggcaa 
ctgtgagccc 
tcacagttac 
acacactcac 
ccacccaagg 
tcaccaccac 
ctggtagctt 
taaagtacgg 
gtccaggtgt 
ttgattcaca 
acagacgcct 
gactaggaca 
aaggccttta 
ctgccaaggg 
aatttggttc 
tagaatttga 
gcacaggtgc 
cagctccatc 
taacaaaatg 
tggctccaat 
gagtgctact 
ttactgaagg 
caaaatctca 
acaaaactaa 
caactccaag 
atgaaatatt 
tttgtgttat 
ttcagtagta 
cacagaaccc 
tctccccggc 
atattccaca 



tcacgcctcg 
gcattggaac 
cgggacctcc 
cggcggacgg 
tggtccactg 
aattgcccga 
agcttgcccg 
ggggaccctg 
tttgttgcca 
tatcgccatc 
cctggtactt 
agtctacgag 
acctgccggg 
aaaccagact 
tagaaaaccc 
aagcaactct 
agcatctgac 
caccctaaca 
cacaaataca 
gttctccata 
gcgcaaacgc 
tggaatccat 
tgagatctag 
gcggccgagc 
aaaaggggta 
ggacaccgcc 
gaaaagccca 
tgtcaaggac 
attcccttta 
atttctgtcc 
cttcctcctg 
ctgtccatcc 
agataccttc 
tcttactcct 
tttgcgccta 
cggcctctct 
acctctcaaa 
ctcagaagcc 
catgcaatca 
acccctcaca 
cgatagcagt 
gggcattgac 
ggctcctttg 
gactattaat 
aggcaatatg 
tatacttgat 
gggccctctt 
cttgtttaca 
gttgatgttt 
acctaatgca 
ttcaaacaag 
cattacagta 
tcctaactgt 
tggcagtcaa 
atctggaaca 
aaacaattcc 
cacagcctat 
cggtaaaact 
acctgtaaca 
tgcatactct 
tgccacatcc 
gtttcaacgt 
tagccccacc 
tagtattcaa 
tggccttaaa 
cggtttcctg 



ucaggcaatc 
tctgcaattt 
cggccactat 
ctacgactga 
tcgccgccac 
ggatcatatc 
tagcctgatt 
tgttctcact 
tctctgtgct 
ctgtaaacgc 
ttaacatctc 
agaacctctc 
aacgtacgag 
ttttccggac 
ttagggtatt 
acgggctatt 
tttggccagc 
gagatgacca 
ccccaagttt 
gcgcttatgt 
gcccgaccac 
agattggacg 
aaatggacgg 
aacagcgcat 
tcttttgtct 
ttagctacaa 
ttaccataac 
ctgaggatct 
actaataaaa 
agtttattca 
gctgcaaact 
gcacccacta 
aaccccgtgt 
ccctttgtat 
tccgaacctc 
ctggacgagg 
aaaaccaagt 
ctaactgtgg 
caggccccgc 
gtgtcagaag 
acccttacta 
ttgaaagagc 
catgtaacag 
aatacttcct 
caacttaatg 
gttagttatc 
tttataaact 
gcttcaaaca 
gacgctacag 
ccaaacacaa 
gctatggttc 
ggaaacaaaa 
agactaaatg 
atacttgcta 
gttcaaagtg 
ttcctggacc 
acaaacgctg 
gccaaaagta 
ctaaccatta 
atgtcatttt 
tcttacactt 
gtttattttt 
accacatagc 
cctgccacct 
aagcatcata 
tcgagccaaa 



ctaactctgc 

attgaggagt 

ccggatcaat 

atgttaagtg 

aagtgctttg 

gagggcccgg 

cgggagttta 

gtgatttgca 

gagtataata 

caccgtcttc 

tccctctgtg 

cgagctcagc 

tgcgtcaccg 

agacctcaat 

aggccaaagg 

ctaattcagg 

acctgtcccg 

acacaaccaa 

ctgcctttgt 

ttgtatgcct 

ccatctatag 

gactgaaaca 

aattattaca 

gaatcaagag 

ggtaaagcag 

gttgccaacc 

tcagcactcg 

ctgcaccctt 

aaaaataata 

gcagcacctc 

ttctccacaa 

tcttcatgtt 

atccatatga 

cccccaatgg 

tagttacctc 

ccggcaacct 

caaacataaa 

ctgccgccgc 

taaccgtgca 

gaaagctagc 

tcactgcctc 

ccatttatac 

acgacctaaa 

tgcaaactaa 

tagcaggagg 

cgtttgatgc 

cagcccacaa 

attccaaaaa 

ccatagccat 

atcccctcaa 

ctaaactagg 

ataatgataa 

cagagaaaga 

cagtttcagt 

ctcatcttat 

cagaatattg 

ttggatttat 

acattgtcag 

cactaaacgg 

catgggactg 

tttcatacat 

caattgcaga 

ttatacagat 

ccctcccaac 

tcatgggtaa 

cgctcatcag 



agacctcgtc 
ttgtgccatc 
ttattcctaa 
gagaggcaga 
cccgcgactc 
cgcacggcgt 
cccagcgccc 
actgtcctaa 
aatacagaaa 
acccgcccaa 
atttacaaca 
tactccatca 
gccgctgcac 
aactctgttt 
cgcagctact 
tttctctaga 
cggatttgtt 
cgcggccgcc 
caataactgg 
tattattatg 
tcccatcatt 
catgttcttt 
gagcagcgcc 
ctccaagaca 
gccaaagtca 
aagcgtcaga 
gtagaaaccg 
attaagaccc 
aagcatcact 
cttgccctcc 
tctaaatgga 
gttgcagatg 
cacggaaacc 
gtttcaagag 
caatggcatg 
tacctcccaa 
cctggaaata 
acctctaatg 
cgactccaaa 
cctgcaaaca 
accccctcta 
acaaaatgga 
cactttgacc 
agttactgga 
actaaggatt 
tcaaaaccaa 
cttggatatt 
gcttgaggtt 
taatgcagga 
aacaaaaatt 
aactggcctt 
gctaactttg 
tgctaaactc 
tttggctgtt 
tataagattt 
gaactttaga 
gcctaaccta 
tcaagtttac 
tacacaggaa 
gtctggccac 
tgcccaagaa 
aaatttcaag 
caccgtacct 
acacagagta 
cagacatatt 
tgatattaat 



ctctgagccg 
ggtctacttt 
ctttgacgcg 
gcaactgcgc 
cggtgagttt 
ccggcttacc 
cctgctagtt 
ccttggatta 
ttaaaatata 
gcaaaccaag 
gtttcaaccc 
gaaaaaacac 
cacacctacc 
accagaacag 
gtggggttta 
agtcaggctt 
ccagtccaac 
gctaccggac 
gataacttgg 
tggctcatct 
gtgctacacc 
tctcttacag 
tgctagaaag 
tggttaactt 
cctacgacag 
aattggtggt 
aaggctgcat 
tgtgcggtct 
tacttaaaat 
tcccagctct 
atgtcagttt 
aagcgcgcaa 
ggtcctccaa 
agtccccctg 
cttgcgctca 
aatgtaacca 
tctgcacccc 
gtcgcgggca 
cttagcattg 
tcaggccccc 
actactgcca 
aaactaggac 
gtagcaactg 
gccttgggtt 
gattctcaaa 
ctaaatctaa 
aactacaaca 
aacctaagca 
gatgggcttg 
ggccatggcc 
agttttgaca 
tggaccacac 
actttggtct 
aaaggcagtt 
gacgaaaatg 
aatggagatc 
tcagcttatc 
ttaaacggag 
acaggagaca 
aactacatta 
taaagaatcg 
tcatttttca 
taatcaaact 
cacagtcctt 
cttaggtgtt 
aaactccccg 



27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 
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ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat tcgcaccgcc 31740 
cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc acttaaatca 31800 
gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg caaggcgctg 31860 
tatccaaagc tcatggcggg gaccacagaa cccacgtggc catcatacca caagcgcagg 31920 
tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc ttttggcatg 31980 
ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc gccatccacc 32040 
accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag ggaaccggga 32100 
ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat gctcgtcatg 32160 
atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat tacaagctcc 32220 
tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt aaatcccaca 32280 
ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc 32340 
agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg aggtagacga 32400 
tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca 324 60 
aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg cgtgacaaac 32520 
agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt agtatatcca 32580 
ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc cttcatgcgc 32640 
cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac ctacacattc 32700 
gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt tttttttttt 327 60 
attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg cgctcccctc 32820 
cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta agatgttgca 32880 
caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg ctaaaccctt 32940 
cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa taattctcat 33000 
ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg gccattgtaa 33060 
aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg attgcaaaaa 33120 
ttcaggttcc tcacagacct gtataagatt caaaagcgga acaittaacaa aaataccgcg 33180 
atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg cacggaccag 33240 
cgcggccact tccccgccag gaaccttgac aaaagaaccc acactgatta tgacacgcat 33300 
actcggagct atgctaacca gcgtagcccc gatgtaagct ttgttgcatg ggcggcgata 33360 
taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 33420 
cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 33480 
ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 33540 
aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 33600 
acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 33660 
caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 33720 
ttgattcatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata catacccgca 33780 
ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata ggagagaaaa 33840 
acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc cgctccagaa 33900 
caacatacag cgcttcacag cggcagccta acagtcagcc ttaccagtaa aaaagaaaac 33960 
ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 34020 
ccaagtgcgt tacactgcag caggtgtgac tcagccatgg cacctctgca gcctgggtac 34080 
cctgcttggg gcatggcccc ttatagctgg gcggggcgtg ggggctctgt aggagtggca 34140 
gcgacctcag tgtttgtctt tgctctgaag agccctccag gtgcttgatc ccaccttttc 34200 
ccagcaggaa cactcctgcc tgccttacca cctgtcctgg ctgatggcct gttcctgcct 34260 
cctttgcccc ctgcccagac tcccatgttc ctggacttgt ggcttcctcc aaccaggggc 34320 
tctcaagcct ccatacctgg tcccacctct ccaggccgtg ggagggaggt tgaggagggt 34380 
ggagggcatc tggttggggg cagcctgggt gttcccctcc catcccctcc ctgggcctcc 34440 
caggccccct ctactcttga gcaatgctct tgagagcttc ctgcctggct cttaacccag 34500 
ggcaagccct ggaagggcag acccaggaca ctctcaccac ctccttacct tttcccctgg 34560 
aaaaatcttc tgtatacttc ccattttaag aaaactacaa ttcccaacac atacaagtta 34620 
ctccgcccta aaacctacgt cacccgcccc gttcccacgc cccgcgccac gtcacaaact 34680 
ccaccccctc attatcatat tggcttcaat ccaaaataag gtatattatt gatgatg 34737 

<210> 16 
<211> 36114 
<212> DNA 

<213> Adenovirus siabgroup C 
<400> 16 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
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agtgaaatct gaataatttt gtgttactca 

gactttgacc gtttacgtgg agactcgccc 

cgggtcaaag ttggcgtttt attattatag 

tgagttcctc aagaggccac tcttgagtgc 

tccgacaccg ggactgaaaa tgagacatga 

ccattttgaa ccacctaccc ttcacgaact 

tcccaacgag gaggcggttt cgcagatttt 

agggattgac ttactcactt ttccgccggc 

ccggcagccc gagcagccgg agcagagagc 

tccacccagt gacgacgagg atgaagaggg 

ccccgggcac ggttgcaggt cttgtcatta 

tatgtgttcg ctttgctata tgaggacctg 

atgggcagtg ggtgatagag tggtgggttt 

gttttgtggt ttaaagaatt ttgtattgtg 

gagcctgagc ccgagccaga accggagcct 

cctgctatcc tgagacgccc gacatcacct 

agctgtgact ccggtccttc taacacacct 

cccattaaac cagttgccgt gagagttggt 

gacttgctta acgagcctgg gcaacctttg 

ggtgtaaacc tgtgattgcg tgtgtggtta 

agtttaataa agggtgagat aatgtttaac 

aaagggtata taatgcgccg tgggctaatc 

gagtgtttgg aagatttttc tgctgtgcgt 

tcttggtttt ggaggtttct gtggggctca 

gaggattaca agtgggaatt tgaagagctt 

ttgaatctgg gtcaccaggc gcttttccaa 

acaccggggc gcgctgcggc tgctgttgct 

gaagaaaccc atctgagcgg ggggtacctg 

gcggttgtga gacacaagaa tcgcctgcta 

ccgacggagg agcagcagca gcagcaggag 

ccatggaacc cgagagccgg cctggaccct 

tgtatccaga actgagacgc attttgacaa 

taaagaggga gcggggggct tgtgaggcta 

taatgaccag acaccgtcct gagtgtatta 

atgagcttga tctgctggcg cagaagtatt 

agccagggga tgattttgag gaggctatta 

attgcaagta caagatcagc aaacttgtaa 

acggggccga ggtggagata gatacggagg 

atatgtggcc gggggtgctt ggcatggacg 

gccccaattt tagcggtacg gttttcctgg 

gcttctatgg gtttaacaat acctgtgtgg 

gtgcctttta ctgctgctgg aagggggtgg 

agaaatgcct ctttgaaagg tgtaccttgg 

gccacaatgt ggcctccgac tgtggttgct 

agcataacat ggtatgtggc aactgcgagg 

acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata . ctgacccgct 

tgttcctacc ttaccaatgc aatttgagtc 

tgtccaaggt gaacctgaac ggggtgtttg 

ggtacgatga gacccgcacc aggtgcagac 

accagcctgt gatgctggat gtgaccgagg 

gcacccgcgc tgagtttggc tctagcgatg 

ggcgtggctt aagggtggga aagaatatat 

gttttgcagc agccgccgcc gccatgagca 

catatttgac aacgcgcatg cccccatggg 

gcattgatgg tcgccccgtc ctgcccgcaa 

ctggaacgcc gttggagact gcagcctccg 

gcgggattgt gactgacttt gctttcctga 

catccgcccg cgatgacaag ttgacggctc 

aacttaatgt cgtttctcag cagctgttgg 

cttcctcccc tcccaatgcg gtttaaaaca 

ggatcaagca agtgtcttgc tgtctttatt 

accagcggtc tcggtcgttg agggtcctgt 

tctggatgtt cagatacatg ggcataagcc 

gagcttcatg ctgcggggtg gtgttgtaga 

ggtgcctaaa aatgtctttc agtagcaagc 



tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aactcgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
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tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
raatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4 620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4 680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4740 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4 920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 5040 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 534 0 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 5460 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 5940 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 6240 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc tcgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7 980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
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tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgagtcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 
gagcctgtaa gcgggcactc ttccgtggtc 
ggacgaccgg ggttcgagcc ccgtatccgg 
cgcgtgtcga acccaggtgt gcgacgtcag 
tccaggcgcg gcggctgctg cgctagcttt 
gttaggctgg aaagcgaaag cattaagtgg 
caagggttga gtcgcgggac ccccggttcg 
ggggtttgcc tccccgtcat gcaagacccc 
gccccttttt tgcttttccc agatgcatcc 
gcagcggcaa gagcaagagc agcggcagac 
gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 
cgacctgggc gtttatcgca acgagcgcat 
cgagctcagc gaccgcgagc tgatgcacag 
cggcgataga gaggccgagt cctactttga 
ccgacgcgcc ctggaggcag ctggggccgg 
tggcaacgtc ggcggcgtgg aggaatatga 



ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 8460 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 864 0 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 9480 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
tggtggataa attcgcaagg gtatcatggc 10560 
ccgtccgccg tgatccatgc ggttaccgcc 10620 
acaacggggg agtgctcctt ttggcttcct 10680 
tttggccact ggccgcgcgc agcgtaagcg 10740 
ctcgctccct gtagccggag ggttattttc 10800 
agtctcggac cggccggact gcggcgaacg 10860 
gcttgcaaat tcctccggaa acagggacga 10920 
ggtgctgcgg cagatgcgcc cccctcctca 10980 
atgcagggca ccctcccctc ctcctaccgc 1104 0 
ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 11460 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
ccacaaggcc gtgagcgtga gccggcggcg 11940 
cctgcaaagg gccctggctg gcacgggcag 12000 
cgcgggcgct gacctgcgct gggccccaag 12060 
acctgggctg gcggtggcac ccgcgcgcgc 12120 
cgaggacgat gagtacgagc cagaggacgg 12180 
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cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tctccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
cttaacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtctc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt ttggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 
tccagcgagt gaccattact gacgccagac 
tgggcatagt ctcgccgcgc gtcctatcga 
ttatatcgcc cagcaataac acaggctggg 
gggccaagaa gcgctccgac caacacccag 
ggggcgcgca caaacgcggc cgcactgggc 
tggtggagga ggcgcgcaac tacacgccca 
ccattcagac cgtggtgcgc ggagcccggc 
gcgtagcacg tcgccaccgc cgccgacccg 



atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 124'20 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 12540 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 134 40 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcctggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggcaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 144 60 
tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14 640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
tgcagccggt ggaggacatg aacgatcatg 15060 
ctgaggagaa gcgcgctgag gccgaagcag 15120 
ccgaggtcga gaagcctcag aagaaaccgg 15180 
aacgcagtta caacctaata agcaatgaca 15240 
ttgcatacaa ctacggcgac cctcagaccg 15300 
ctgacgtaac ctgcggctcg gagcaggtct 15360 
ccgtgacctt ccgctccacg cgccagatca 15420 
tgcccgtgca ctccaagagc ttctacaacg 15480 
agtttacctc tctgacccac gtgttcaatc 1554 0 
cgccagcccc caccatcacc accgtcagtg 15600 
cgctaccgct gcgcaacagc atcggaggag 15660 
gccgcacctg cccctacgtt tacaaggccc 15720 
gccgcacttt ttgagcaagc atgtccatcc 15780 
gcctgcgctt cccaagcaag atgtttggcg 15840 
tgcgcgtgcg cgggcactac cgcgcgccct 15900 
gcaccaccgt cgatgacgcc atcgacgcgg 15960 
cgccgccacc agtgtccaca gtggacgcgg 16020 
gctatgctaa aatgaagaga cggcggaggc 16080 
gcactgccgc ccaacgcgcg gcggcggccc 16140 
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tgcttaaccg 

ccgcgggtan 

cggccattag 

ttagcggcct 

actacttaga 

ccaagcgcaa 

cgaagaagga 

aagatgatga 

gacgggtaca 

tctttacgcc 

gcgacgagga 

ggcataagga 

ccgtaacact 

agcgcgagtc 

tggaagatgt 

ggccaatcaa 

ctaccagtag 

ttgcctcagc 

ctacggaggt 

gttcgaggaa 

ttgcgcctac 

gacgccgaac 

cgatttccgt 

gctaccaccc 

cctgccgcct 

tggccggcca 

cgcaccgtcg 

ttggcgccgt 

caagttgcat 

taactatttt 

cgcccgttca 

agctggggct 

agcaaggcct 

ttccaacaaa 

aaccaggcag 

cctccaccgg 

gacagggaag 

aagcaaggcc 

cacacacccg 

ccaggcccga 

agcggtccgc 

atcgtgggtc 

tcgtatgtgt 

gcccgctttc 

cgggccagga 

agacgtactt 

tgaccacaga 

ctgcgtactc 

tggcttccac 

actctggcac 

atgaagctgc 

aagtagacga 

gtataaatat 

ccgataaaac 

ttaatcatgc 

catatgcaaa 

gaaagctaga 

atggtgataa 

cagacactca 

gccaacaatc 

taatgtatta 

atgctgttgt 

ccattggtga 

cagatgttag 

cactgggagg 

aaaatggatg 



cgcacgtcgc 
tgtcactgtg 
tgctatgact 
gcgcgtgccc 
ctcgtactgt 
aatcaaagaa 
agagcaggat 
tgatgaactt 
gtggaaaggt 
cggtgagcgc 
cctgcttgag 
catgctggcg 
gcagcaggtg 
tggtgacttg 
cttggaaaaa 
gcaggtggcg 
caccagtatt 
ggtggcggat 
gcaaacggac 
gtacggcgcc 
ccccggctat 
caccactgga 
gcgcagggtg 
cagcatcgtt 
ccgtttcccg 
cggcctgacg 
catgcgcggc 
gcccggaatt 
gtggaaaaat 
gtagaatgga 
tgggaaactg 
cgctgtggag 
ggaacagcag 
aggtggtaga 
tgcaaaataa 
ccgtggagac 
aaactctggt 
tgcccaccac 
taacgctgga 
ccgccgttgt 
gatcgttgcg 
tgggggtgca 
gtcatgtatg 
caagatggct 
cgcctcggag 
cagcctgaat 
ccggtcccag 
gtacaaggcg 
gtactttgac 
tgcctacaac 
tactgctctt 
gcaagctgag 
tacaaaggag 
atttcaacct 
agctgggaga 
acccacaaat 
aagtcaagtg 
cttgactcct 
tatttcttac 
tatgcccaac 
caacagcacg 
agatttgcaa 
tagaaccagg 
aattattgaa 
tgtgattaat 
ggaaaaagat 



accggccgac 
ccccccaggt 
cagggtcgca 
gtgcgcaccc 
tgtatgtatc 
gagatgctcc 
tacaagcccc 
gacgacgagg 
cgacgcgtaa 
tccacccgca 
caggccaacg 
ttgccgctgg 
ctgcccgcgc 
gcacccaccg 
atgaccgtgg 
ccgggactgg 
gccaccgcca 
gccgcggtgc 
ccgtggatgt 
gccagcgcgc 
cgtggctaca 
acccgccgcc 
gctcgcgaag 
taaaagccgg 
gtgccgggat 
ggcggcatgc 
ggtatcctgc 
gcatccgtgg 
caaaataaaa 
agacatcaac 
gcaagatatc 
cggcattaaa 
cacaggccag 
tggcctggcc 
gattaacagt 
agtgtctcca 
gacgcaaata 
ccgtcccatc 
cctgcctccc 
tgtaacccgt 
gcccgtagcc 
atccctgaag 
cgtccatgtc 
accccttcga 
tacctgagcc 
aacaagttta 
cgtttgacgc 
cggttcaccc 
atccgcggcg 
gccctggctc 
gaaataaacc 
cagcaaaaaa 
ggtattcaaa 
gaacctcaaa 
gtccttaaaa 
gaaaatggag 
gaaatgcaat 
aaagtggtat 
atgcccacta 
aggcctaatt 
ggtaatatgg 
gacagaaaca 
tacttttcta 
aatcatggaa 
acagagactc 
gctacagaat 



gggcggccat 
ccaggcgacg 
ggggcaacgt 
gccccccgcg 
cagcggcggc 
aggtcatcgc 
gaaagctaaa 
tggaactgct 
aacgtgtttt 
cctacaagcg 
agcgcctcgg 
acgagggcaa 
ttgcaccgtc 
tgcagctgat 
aacctgggct 
gcgtgcagac 
cagagggcat 
aggcggtcgc 
ttcgcgtttc 
tactgcccga 
cctaccgccc 
gccgtcgccg 
gaggcaggac 
tctttgtggt 
tccgaggaag 
gtcgtgcgca 
ccctccttat 
ccttgcaggc 
agtctggact 
tttgcgtctc 
ggcaccagca 
aatttcggtt 
atgctgaggg 
tctggcatta 
aagcttgatc 
gaggggcgtg 
gacgagcctc 
gcgcccatgg 
cccgccgaca 
cctagccgcg 
agtggcaact 
cgccgacgat 
gccgccagag 
tgatgccgca 
ccgggctggt 
gaaaccccac 
tgcggttcat 
tagctgtggg 
tgctggacag 
ccaagggtgc 
tagaagaaga 
ctcacgtatt 
taggtgtcga 
taggagaatc 
agactacccc 
ggcaaggcat 
ttttctcaac 
tgtacagtga 
ttaaggaagg 
acattgcttt 
gtgttctggc 
cagagctttc 
tgtggaatca 
ctgaagatga 
ttaccaaggt 
tttcagataa 



gcgggccgct 
agcggccgcc 
gtattgggtg 
caactagatt 
ggcgcgcaac 
gccggagatc 
gcgggtcaaa 
gcacgctacc 
gcgacccggc 
cgtgtatgat 
ggagtttgcc 
cccaacacct 
cgaagaaaag 
ggtacccaag 
ggagcccgag 
cgtggacgtt 
ggagacacaa 
tgcggccgcg 
agccccccgg 
atatgcccta 
cagaagacga 
tcgccagccc 
cctggtgctg 
tcttgcagat 
aatgcaccgt 
ccaccggcgg 
tccactgatc 
gcagagacac 
ctcacgctcg 
tggccccgcg 
atatgagcgg 
ccaccgttaa 
ataagttgaa 
gcggggtggt 
cccgccctcc 
gcgaaaagcg 
cctcgtacga 
ctaccggagt 
cccagcagaa 
cgtccctgcg 
ggcaaagcac 
gcttctgaat 
gagctgctga 
gtggtcttac 
gcagtttgcc 
ggtggcgcct 
ccctgtggac 
tgataaccgt 
gggccctact 
cccaaatcct 
ggacgatgac 
tgggcaggcg 
aggtcaaaca 
tcagtggtac 
aatgaaacca 
tcttgtaaag 
tactgaggcg 
agatgtagat 
taactcacga 
tagggacaat 
gggccaagca 
ataccagctt 
ggctgttgac 
acttccaaat 
aaaacctaaa 
aaatgaaata 



cgaaggctgg 
gcagcagccg 
cgcgactcgg 
gcaagaaaaa 
gaagctatgt 
tatggccccc 
aagaaaaaga 
gcgcccaggc 
accaccgtag 
gaggtgtacg 
tacggaaagc 
agcctaaagc 
cgcggcctaa 
cgccagcgac 
gtccgcgtgc 
cagataccca 
acgtccccgg 
tccaagacct 
cgcccgcgcg 
catccttcca 
gcaactaccc 
gtgctggccc 
ccaacagcgc 
atggccctca 
aggaggggca 
cggcgcgcgt 
gccgcggcga 
tgattaaaaa 
cttggtcctg 
acacggctcg 
tggcgccttc 
gaactatggc 
agagcaaaat 
ggacctggcc 
cgtagaggag 
tccgcgcccc 
ggaggcacta 
gctgggccag 
acctgtgctg 
ccgcgccgcc 
actgaacagc 
agctaacgtg 
gccgccgcgc 
atgcacatct 
cgcgccaccg 
acgcacgacg 
cgtgaggata 
gtgctggaca 
tttaagccct 
tgcgaatggg 
aacgaagacg 
ccttattctg 
cctaaatatg 
gaaactgaaa 
tgttacggtt 
caacaaaatg 
accgcaggca 
atagaaaccc 
gaactaatgg 
tttattggtc 
tcgcagttga 
ttgcttgatt 
agctatgatc 
tactgctttc 
acaggtcagg 
agagttggaa 



16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
X9260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
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ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg ccccagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 
gtcggtgttg gagatctgca ccacatttcg 
gctagactgc tccttcagcg cgcgctgccc 
gtgctcctta tttatcataa tgcttccgtg 
gcagcggtgc agccacaacg cgcagcccgt 
aaacgactgc aggtacgcct gcaggaatcg 
ggtgaaggtc agctgcaacc cgcggtgctc 
cagagcttcc acttggtcag gcagtagttt 
gtacttgtcc atcagcgcgc gcgcagcctc 
cacactcagc gggttcatca ccgtaatttc 
ctcttgcgtc cgcataccac gcgccactgg 
cttacctcct ttgccatgct tgattagcac 
cgccacatct tctctttctt cctcgctgtc 
gggcttggga gaagggcgct tctttttctt 
ggtcgatggc cgcgggctgg gtgtgcgcgg 
gtcctcggac tcgatacgcc gcctcatccg 
cgacggggac ggggacgaca cgtcctccat 
gcgctcgggg gtggtttcgc gctgctcctc 
gcagaaaaag atcatggagt cagtcgagaa 
cgccaccacc gcctccaccg atgccgccaa 



ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 204 60 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 2274 0 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
gccccaccgg ttcttcacga tcttggcctt 22980 
gttttcgctc gtcacatcca tttcaatcac 23040 
tagacactta agctcgcctt cgatctcagc 23100 
gggctcgtga tgcttgtagg tcacctctgc 23160 
ccccatcatc gtcacaaagg tcttgttgct 23220 
ctcgttcagc caggtcttgc atacggccgc 23280 
gaagttcgcc tttagatcgt tatccacgtg 23340 
catgcccttc tcccacgcag acacgatcgg 23400 
actttccgct tcgctgggct cttcctcttc 23460 
gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cggtgggttg ctgaaaccca ccatttgtag 23580 
cacgattacc tctggtgatg gcgggcgctc 23640 
cttgggcgca atggccaaat ccgccgccga 23700 
caccagcgcg tcttgtgatg agtcttcctc 23760 
cttttttggg ggcgcccggg gaggcggcgg 23820 
ggttggggga cgtcgcgccg caccgcgtcc 23880 
ttcccgactg gccatttcct tctcctatag 23940 
gaaggacagc ctaaccgccc cctctgagtt 24000 
cgcgcctacc accttccccg tcgaggcacc 24060 
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cccgcttgag gaggaggaag tgattatcga 
cgaggaccgc tcagtaccaa cagaggataa 
cgaggaacaa gtcgggcggg gggacgaaag 
cgtgctgttg aagcatctgc agcgccagtg 
cagcgatgtg cccctcgcca tagcggatgt 
accgcgcgta ccccccaaac gccaagaaaa 
cttctacccc gtatttgccg tgccagaggt 
ctgcaagata cccctatcct gccgtgccaa 
gcggcagggc gctgtcatac ctgatatcgc 
gggtcttgga cgcgacgaga agcgcgcggc 
tgaaagtcac tctggagtgt tggtggaact 
aaaacgcagc atcgaggtca cccactttgc 
catgagcaca gtcatgagtg agctgatcgt 
aaatttgcaa gaacaaacag aggagggcct 
ctggcttcaa acgcgcgagc ctgccgactt 
agtgctcgtt accgtggagc ttgagtgcat 
gcgcaagcta gaggaaacat tgcactacac 
caagatctcc aacgtggagc tctgcaacct 
ccgccttggg caaaacgtgc ttcattccac 
ccgcgactgc gtttacttat ttctatgcta 
gcagtgctrg gaggagtgca acctcaagga 
ggacctatgg acggccttca acgagcgctc 
ccccgaacgc ctgcttaaaa ccctgcaaca 
gttgcagaac tttaggaact ttatcctaga 
tgcacttcct agcgactttg tgcccattaa 
ccactgctac cttctgcagc tagccaacta 
cgtgagcggt gacggtctac tggagtgtca 
ctccctggtt tgcaattcgc agctgcttaa 
gcagggtccc tcgcctgacg aaaagtccgc 
gtggacgtcg gcttaccttc gcaaatttgt 
gttctacgaa gaccaatccc gcccgccaaa 
gggccacatt cttggccaat tgcaagccat 
aaagggacgg ggggtttact tggaccccca 
gccgccgcag ccctatcagc agcagccgcg 
agaagctgca gctgccgccg ccacccacgg 
aggaggtttt ggacgaggag gaggaggaca 
aagcttccga ggtcgaagag gtgtcagacg 
cgccggcgcc ccagaaatcg gcaaccggtt 
cgccgccggc actgcccgtt cgccgaccca 
ccggtaagtc caagcagccg ccgccgttag 
gctcatggcg cgggcacaag aacgccatag 
tctccttcgc ccgccgcttt cttctctacc 
tgcattacta ccgtcatctc tacagcccat 
acagcagcgg ccacacagaa gcaaaggcga 
aaatccacag cggcggcagc agcaggagga 
gtatcgaccc gcgagcttag aaacaggatt 
agcaggggcc aagaacaaga gctgaaaata 
agctgcctgt atcacaaaag cgaagatcag 
ctcttcagta aatactgcgc gctgactctt 
taagcgcgaa aactacgtca tctccagcgg 
gccattatga gcaaggaaat tcccacgccc 
cttgcggctg gagctgccca agactactca 
cacatgatat cccgggtcaa cggaatccgc 
gcggctatta ccaccacacc tcgtaataac 
gtgtaccagg aaagtcccgc tcccaccact 
gttcagatga ctaactcagg ggcgcagctt 
cccgggcagg gtataactca cctgacaatc 
tcggtgagct cctcgcttgg tctccgtccg 
cgtccttcat tcacgcctcg tcaggcaatc 
cgctctggag gcattggaac tctgcaattt 
aaccccttct cgggacctcc cggccactat 
gtaaaggact cggcggacgg ctacgactga 
ctgaaacacc tggtccactg tcgccgccac 
tgctactttg aattgcccga ggatcatatc 
gcccagggag agcttgcccg tagcctgatt 
gagcgggaca ggggaccctg tgttctcact 



gcaggaccca ggttttgtaa gcgaagacga 24120 
aaagcaagac caggacaacg cagaggcaaa 24180 
gcatggcgac tacctagatg tgggagacga 24240 
cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagccttgcc tacgaacgcc acctattctc 24360 
cggcacatgc gagcccaacc cgcgcctcaa 24420 
gcttgccacc tatcacatct ttttccaaaa 24480 
ccgcagccga gcggacaagc agctggcctt 24 540 
ctcgctcaac gaagtgccaa aaatctttga 24 600 
aaacgctctg caacaggaaa acagcgaaaa 24 660 
cgagggtgac aacgcgcgcc tagccgtact 24 720 
ctacccggca cttaacctac cccccaaggt 24 780 
gcgccgtgcg cagcccctgg agagggatgc 24840 
acccgcagtt ggcgacgagc agctagcgcg 24 900 
ggaggagcga cgcaaactaa tgatggccgc 24960 
gcagcggttc tttgctgacc cggagatgca 25020 
ctttcgacag ggctacgtac gccaggcctg 25080 
ggtcrcctac cttggaattt tgcacgaaaa 25140 
gctcaagggc gaggcgcgcc gcgactacgt 25200 
cacctggcag acggccatgg gcgtttggca 25260 
gctgcagaaa ctgctaaagc aaaacttgaa 25320 
cgtggccgcg cacctggcgg acatcatttt 25380 
gggtctgcca gacttcacca gtcaaagcat 25440 
gcgctcagga atcttgcccg ccacctgctg 25500 
gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccttgcctac cactctgaca taatggaaga 25620 
ctgtcgctgc aacctatgca ccccgcaccg 25680 
cgaaagtcaa attatcggta cctttgagct 25740 
ggctccgggg ttgaaactca ctccggggct 25800 
acctgaggac taccacgccc acgagattag 25860 
tgcggagctt accgcctgcg tcattaccca 25920 
caacaaagcc cgccaagagt ttctgctacg 25980 
gtccggcgag gagctcaacc caatcccccc 26040 
ggcccttgct tcccaggatg gcacccaaaa 26100 
acgaggagga atactgggac agtcaggcag 26160 
tgatggaaga ctgggagagc ctagacgagg 26220 
aaacaccgtc accctcggtc gcattcccct 26280 
ccagcatggc tacaacctcc gctcctcagg 26340 
accgtagatg ggacaccact ggaaccaggg 26400 
cccaagagca acaacagcgc caaggctacc 26460 
ttgcttgctt gcaagactgt gggggcaaca 26520 
atcacggcgt ggccttcccc cgtaacatcc 26580 
actgcaccgg cggcagcggc agcggcagca 2664 0 
ccggatagca agactctgac aaagcccaag 26700 
ggagcgctgc gtctggcgcc caacgaaccc 26760 
tttcccactc tgtatgctat atttcaacag 26820 
aaaaacaggt ctctgcgatc cctcacccgc 2 6880 
cttcggcgca cgctggaaga cgcggaggct 2 6940 
aaggactagt ttcgcgccct ttctcaaatt 27000 
ccacacccgg cgccagcacc tgtcgtcagc 27060 
tacatgtgga gttaccagcc acaaatggga 27120 
acccgaacaa actacatgag cgcgggaccc 27180 
gcccaccgaa accgaattct cttggaacag 27240 
cttaatcccc gtagttggcc cgctgccctg 27300 
gtggtacttc ccagagacgc ccaggccgaa 27360 
gcgggcggct ttcgtcacag ggtgcggtcg 27420 
agagggcgag gtattcagct caacgacgag 27480 
gacgggacat ttcagatcgg cggcgccggc 27540 
ctaactctgc agacctcgtc ctctgagccg 27600 
attgaggagt ttgtgccatc ggtctacttt 27660 
ccggatcaat ttattcctaa ctttgacgcg 27720 
atgttaagtg gagaggcaga gcaactgcgc 27780 
aagtgctttg cccgcgactc cggtgagttt 27840 
gagggcccgg cgcacggcgt ccggcttacc 27900 
cgggagttta cccagcgccc cctgctagtt 27960 
gtgatttgca actgtcctaa ccttggatta 28020 
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catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctagg gttggggtta 28500 
ttctctgtct tgtgattctc tttattctta tactaacgct tctctgccta aggctcgccg 28560 
cctgctgtgt gcacatttgc atttattgtc agctttttaa acgctggggt cgccacccaa 28620 
gatgattagg tacataatcc taggtttact cacccttgcg tcagcccacg gtaccaccca 28680 
aaaggtggat tttaaggagc cagcctgtaa tgttacattc gcagctgaag ctaatgagtg 28740 
caccactctt ataaaatgca ccacagaaca tgaaaagctg cttattcgcc acaaaaacaa 28800 
aattggcaag tatgctgttt atgctatttg gcagccaggt gacactacag agtataatgt 28860 
tacagttttc cagggtaaaa gtcataaaac ttttatgtat acttttccat tttatgaaat 28920 
gtgcgacatt accatgtaca tgagcaaaca gtataagttg tggcccccac aaaattgtgt 28980 
ggaaaacact ggcactttct gctgcactgc tatgctaatt acagtgctcg ctttggtctg 29040 
taccctactc tatattaaat acaaaagcag acgcagcttt attgaggaaa agaaaatgcc 29100 
ttaatttact aagttacaaa gctaatgtca ccactaactg ctttactcgc tgcttgcaaa 29160 
acaaattcaa aaagttagca ttataattag aataggattt aaaccccccg gtcatttcct 29220 
gctcaatacc attcccctga acaattgact ctatgtggga tatgctccag cgctacaacc 29280 
ttgaagtcag gcttcctgga tgtcagcatc tgactttggc cagcacctgt cccgcggatt 29340 
tgttccagtc caactacagc gacccaccct aacagagatg accaacacaa ccaacgcggc 29400 
cgccgctacc ggacttacat ctaccacaaa tacaccccaa gtttctgcct ttgtcaataa 29460 
ctgggataac ttgggcatgt ggtggttctc catagcgctt atgtttgtat gccttattat 29520 
tatgtggctc atctgctgcc taaagcgcaa acgcgcccga ccacccatct atagtcccat 29580 
cattgtgcta cacccaaaca atgatggaat ccatagattg gacggactga aacacatgtt 29640 
cttttctctt acagtatgat taaatgagac atgattcctc gagtttttat attactgacc 29700 
cttgttgcgc ttttttgtgc gtgctccaca ttggctgcgg tttctcacat cgaagtagac 29760 
tgcattccag ccttcacagt ctatttgctt tacggatttg tcaccctcac gctcatctgc 29820 
agcctcatca ctgtggtcat cgcctttatc cagtgcattg actgggtctg tgtgcgcttt 29880 
gcatatctca gctgctgcca tgttgtgttg ctaccatgtt gttttcatgt gttgctgcca 29940 
tgctcttgtc gccttagatc tctctttatg tagtgttgtg gtgtctctct tgtcgtgatg 30000 
tgtgttttgt cctatatatt ttaattttta atccaaaccc ctgtccccgc agaggccttt 30060 
gcgttctggt aggccgtcat tgaaaactga cttaactcgt taaattaaaa aaatgtaaaa 30120 
aataatggtt gagactcagc ccaacatcgg cagatgaggt ggattgagac tcagcccaac 30180 
atcggcagat gaggtggatt gagactcaac cccaacattg gcagatgagg tgaattagat 30240 
gaggtggatt gagactcatg agggtggtat gagggcccga cgtccacagg tgggagttgt 30300 
gctttacagt ccaacgtgca ggacgcttgg catttgccag agaacaccaa gattggcaaa 30360 
ttcgcaactg gcgccctgtg ctcttcacag acggaaaaat gaccaaaatc tgattatttt 30420 
tgtaaaacgg aaaccgaatg tccgacaaag ttcatttgat gacttcccgg taggtctgcc 304 80 
ctgccgctgg gccgacgccg tccgggaatt ttacaaacga tttcggacgt ctagcattca 30540 
ctcaccttgt caaggacctg aggatctctg cacccttatt aagaccctgt gcggtctcaa 30600 
agatcttatt ccctttaact aataaaaaaa aataataaag catcacttac ttaaaatcag 30660 
ttagcaaatt tctgtccagt ttattcagca gcacctcctt gccctcctcc cagctctggt 30720 
attgcagctt cctcctggct gcaaactttc tccacaatct aaatggaatg tcagtttcct 30780 
cctgttcctg tccatccgca cccactatct tcatgttgtt gcagatgaag cgcgcaagac 30840 
cgtctgaaga taccttcaac cccgtgtatc catatgacac ggaaaccggt cctccaactg 30900 
tgccttttct tactcctccc tttgtatccc ccaatgggtt tcaagagagt ccccctgggg 30960 
tactctcttt gcgcctatcc gaacctctag ttacctccaa tggcatgctt gcgctcaaaa 31020 
tgggcaacgg cctctctctg gacgaggccg gcaaccttac ctcccaaaat gtaaccactg 31080 
tgagcccacc tctcaaaaaa accaagtcaa acataaacct ggaaatatct gcacccctca 31140 
cagttacctc agaagcccta actgtggctg ccgccgcacc tctaatggtc gcgggcaaca 31200 
cactcaccat gcaatcacag gccccgctaa ccgtgcacga ctccaaactt agcattgcca 31260 
cccaaggacc cctcacagtg tcagaaggaa agctagccct gcaaacatca ggccccctca 31320 
ccaccaccga tagcagtacc cttactatca ctgcctcacc ccctctaact actgccactg 31380 
gtagcttggg cattgacttg aaagagccca tttatacaca aaatggaaaa ctaggactaa 31440 
agtacggggc tcctttgcat gtaacagacg acctaaacac tttgaccgta gcaactggtc 31500 
caggtgtgac tattaataat acttccttgc aaactaaagt tactggagcc ttgggttttg 31560 
attcacaagg caatatgcaa cttaatgtag caggaggact aaggattgat tctcaaaaca 31620 
gacgccttat acttgatgtt agttatccgt ttgatgctca aaaccaacta aatctaagac 31680 
taggacaggg ccctcttttt ataaactcag cccacaactt ggatattaac tacaacaaag 31740 
gcctttactt gtttacagct tcaaacaatt ccaaaaagct tgaggttaac ctaagcactg 31800 
ccaaggggtt gatgtttgac gctacagcca tagccattaa tgcaggagat gggcttgaat 31860 
ttggttcacc taatgcacca aacacaaatc ccctcaaaac aaaaattggc catggcctag 31920 
aatttgattc aaacaaggct atggttccta aactaggaac tggccttagt tttgacagca 31980 
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caggtgccat 

ctccatctcc 

caaaatgtgg 

ctccaatatc 

tgctactaaa 

ctgaaggcac 

aatctcacgg 

aaactaaacc 

ctccaagtgc 

aaatatttgc 

gtgttatgtt 

agtagtatag 

agaaccctag 

ccccggctgg 

ttccacacgg 

agctcactca 

ggttgcttaa 

tgcatcagga 

tccgtcctgc 

agcataaggc 

cagtaactgc 

ccaaagctca 

attaagtggc 

taattcacca 

atcctaaacc 

gaacaatgac 

tcaatgttgg 

cgcgttagaa 

cagggaagac 

agcggatgat 

ctactgtacg 

ggaacgccgg 

tctgcgtctc 

tctcaaagca 

tgccctgata 

ctgcgagtca 

ccaaaagatt 

tggcgtggtc 

tggcttccaa 

ggtgaatctc 

gccaccttct 

tctgctccag 

aggttcctca 

ccgtaggtcc 

ggccacttcc 

cggagctatg 

aatgcaaggt 

agtcatgctc 

tttttctctc 

catttaaaca 

gactacggcc 

cgacagctcc 

attcatcggt 

gtagagacaa 

cataaacacc 

catacagcgc 

ttaaaaaaac 

agtgcgttac 

gcttggggca 

acctcagtgt 

gcaggaacac 

ttgccccctg 

caagcctcca 

gggcatctgg 

gccccctcta 

aagccctgga 



tacagtagga 
taactgtaga 
cagtcaaata 
tggaacagtt 
caattccttc 
agcctataca 
taaaactgcc 
tgtaacacta 
atactctatg 
cacatcctct 
tcaacgtgtt 
ccccaccacc 
tattcaacct 
ccttaaaaag 
tttcctgtcg 
agttcatgtc 
cgggcggcga 
tagggcggtg 
aggaatacaa 
gccttgtcct 
agcacagcac 
tggcggggac 
gacccctcat 
cctcccggta 
agctggccaa 
agtggagagc 
cacaacacag 
ccatatccca 
ctcgcacgta 
cctccagtat 
gagtgcgccg 
acgtagtcat 
cggtctcgcc 
tccaggcgcc 
acatccacca 
cacacgggag 
atccaaaacc 
aaactctaca 
aaggcaaacg 
ctctataaac 
caatatatct 
agcgccctcc 
cagacctgta 
cttcgcaggg 
ccgccaggaa 
ctaaccagcg 
gctgctcaaa 
atgcagataa 
aaacatgtct 
ttagaagcct 
atgccggcgt 
tcggtcatgt 
cagtgctaaa 
cattacagcc 
tgaaaaaccc 
ttcacagcgg 
accactcgac 
actgcagcag 
tggcccctta 
ttgtctttgc 
tcctgcctgc 
cccagactcc 
tacctggtcc 
ttgggggcag 
ctcttgagca 
agggcagacc 



aacaaaaata 
ctaaatgcag 
cttgctacag 
caaagtgctc 
ctggacccag 
aacgctgttg 
aaaagtaaca 
accattacac 
tcattttcat 
tacacttttt 
tatttttcaa 
acatagctta 
gccacctccc 
catcatatca 
agccaaacgc 
gctgtccagc 
aggagaagtc 
gtgctgcagc 
catggcagtg 
ccgggcacag 
cacaatattg 
cacagaaccc 
aaacacgctg 
ccatataaac 
aacctgcccg 
ccaggactcg 
gcacacgtgc 
gggaacaacc 
actcacgttg 
ggtagcgcgg 
agacaaccga 
atttcctgaa 
gcttagatcg 
ccctggcttc 
ccgcagaata 
gagcgggaag 
tcaaaatgaa 
gccaaagaac 
gccctcacgt 
attccagcac 
ctaagcaaat 
accttcagcc 
taagattcaa 
ccagctgaac 
ccttgacaaa 
tagccccgat 
aaatcaggca 
aggcaggtaa 
gcgggtttct 
gtcttacaac 
gaccgtaaaa 
ccggagtcat 
aagcgaccga 
cccataggag 
tcctgcctag 
cagcctaaca 
acggcaccag 
gtgtgactca 
tagctgggcg 
tctgaagagc 
cttaccacct 
catgttcctg 
cacctctcca 
cctgggtgtt 
atgctcttga 
caggacactc 



atgauaagct 
agaaagatgc 
ttrcagtttt 
atcttattat 
aatattggaa 
gatttatgcc 
ttgtcagtca 
taaacggtac 
gggactggtc 
catacattgc 
ttgcagaaaa 
tacagatcac 
tcccaacaca 
tgggtaacag 
tcatcagtga 
tgctgagcca 
cacgcctaca 
agcgcgcgaa 
gtctcctcag 
cagcgcaccc 
ttcaaaatcc 
acgtggccat 
gacataaaca 
ctctgattaa 
ccggctatac 
taaccatgga 
atacacttcc 
cattcctgaa 
tgcattgtca 
gtttctgtct 
gatcgtgttg 
gcaaaaccag 
ctctgtgtag 
gggttctatg 
agccacaccc 
agctggaaga 
gatctattaa 
agataatggc 
ccaagtggac 
cttcaaccat 
cccgaatatt 
tcaagcagcg 
aagcggaaca 
ataatcgtgc 
agaacccaca 
gtaagctttg 
aagcctcgcg 
gctccggaac 
gcataaacac 
aggaaaaaca 
aaactggtca 
aatgtaagac 
aatagcccgg 
gtataacaaa 
gcaaaatagc 
gtcagcctta 
ctcaatcagt 
gccatggcac 
gggcgtgggg 
cctccaggtg 
gtcctggctg 
gacttgtggc 
ggccgtggga 
cccctcccat 
gagcttcctg 
tcaccacctc 



aactttgtgg 
taaactcact 
ggctgttaaa 
aagatttgac 
ctttagaaat 
taacctatca 
agtttactta 
acaggaaaca 
tggccacaac 
ccaagaataa 
tttcaagtca 
cgtaccttaa 
cagagtacac 
acatattctt 
tattaataaa 
caggctgctg 
tgggggtaga 
taaactgctg 
cgatgattcg 
tgatctcact 
cacagtgcaa 
cataccacaa 
ttacctcttt 
acatggcgcc 
actgcaggga 
tcatcatgct 
tcaggattac 
tcagcgtaaa 
aagtgttaca 
caaaaggagg 
gtcgtagtgt 
gtgcgggcgt 
tagttgtagt 
taaactcctt 
agccaaccta 
accatgtttt 
gtgaacgcgc 
atttgtaaga 
gtaaaggcta 
gcccaaataa 
aagtccggcc 
aatcatgatt 
ttaacaaaaa 
aggtctgcac 
ctgattatga 
ttgcatgggc 
caaaaaagaa 
caccacagaa 
aaaataaaat 
acccttataa 
ccgtgattaa 
tcggtaaaca 
gggaatacat 
attaatagga 
accctcccgc 
ccagtaaaaa 
cacagtgtaa 
ctctgcagcc 
gctctgtagg 
cttgatccca 
atggcctgtt 
ttcctccaac 
gggaggttga 
cccctccctg 
cctggctctt 
cttacctttt 



accacaccag 
ttggtcttaa 
ggcagtttgg 
gaaaatggag 
ggagatctta 
gcttatccaa 
aacggagaca 
ggagacacaa 
tacattaatg 
agaatcgttt 
tttttcattc 
tcaaactcac 
agtcctttct 
aggtgttata 
ctccccgggc 
tccaacttgc 
gtcataatcg 
ccgccgccgc 
caccgcccgc 
taaatcagca 
ggcgctgtat 
gcgcaggtag 
tggcatgttg 
atccaccacc 
accgggactg 
cgtcatgata 
aagctcctcc 
tcccacactg 
ttcgggcagc 
tagacgatcc 
catgccaaat 
gacaaacaga 
atatccactc 
catgcgccgc 
cacattcgtt 
tttttttatt 
tcccctccgg 
tgttgcacaa 
aacccttcag 
ttctcatctc 
attgtaaaaa 
gcaaaaattc 
taccgcgatc 
ggaccagcgc 
cacgcatact 
ggcgatataa 
agcacatcgt 
aaagacacca 
aacaaaaaaa 
gcataagacg 
aaagcaccac 
catcaggttg 
acccgcaggc 
gagaaaaaca 
tccagaacaa 
agaaaaccta 
aaaagggcca 
tgggtaccct 
agtggcagcg 
ccttttccca 
cctgcctcct 
caggggctct 
ggagggtgga 
ggcctcccag 
aacccagggc 
cccctggaaa 



32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
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aatcttctgt atacttccca ttttaagaaa actacaattc ccaacacata caagttactc 36000 
cgccctaaaa cctacgtcac ccgccccgtt cccacgcccc gcgccacgtc acaaactcca 36060 
ccccctcatt atcatattgg cttcaatcca aaataaggta tattattgat gatg 36114 

<210> 17 
<211> 40 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 17 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp 
35 40 



<210> 18 
<211> 19 
<212> PRT 

<213> Adenovirus subgroup C 

<400> 18 

Met Trp Trp Phe Ser lie Ala Leu Met Phe Val Cys Leu He He Met 
1 5 10 15 

Trp Leu He 



<210> 19 
<211> 8 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 19 

Lys Arg Arg Arg Ala Arg Pro Pro 
1 5 



<210> 20 
<211> 42 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 20 

Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He 
15 10 15 

He Val Leu Asn Pro His Asn Glu Lys He His Arg Leu Asp Gly Leu 
20 25 30 

Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
35 40 



